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ABSTRACT 


A characterization of quantum measurements by operator-valued measures is 
presented. The ' generalized' measurements include simultaneous approximate mea- 
surement of noncommuting observables. This characterization is suitable for solving 
problems in quantum communication. 

Two realizations of such measurements are discussed. The first is by adj'oining 
an apparatus to the system under observation and performing a measurement cor- 
responding to a self-adjoint operator in the tensor -product Hilbert space of the system 
and apparatus spaces. The second realization is by performing, on the system alone, 
sequential measurements that correspond to self-adjoint operators, basing the choice 
of each measurement on the outcomes of previous measurements. 

Simultaneous generalized measurements are foimd to be equivalent to a single 
•finer grain’ generalized measurement, and hence it is sufficient to consider the set 
of single measurements. 

An alternative characterization of generalized measurement is proposed. It is 
shown to be equivalent to the characterization by operator -valued measures, but it is 
potentially more suitable for the treatment of estimation problems. 

Finally, a study of the interaction between the information-carrying system and a 
measuring apparatus provides clues for the physical realizations of abstractly char- 
acterized quantum measurements. 
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V 


Part I. Characterization of Quantum Measurements 


I. GENERAL INTRODUCTION AND SUMMARY OF PART I 
1. 1 Motivation for This Research 

Recent developments in coherent and incoherent light sources, optical processors, 
detectors, and optical fibers have sparked wide interest in optical communication sys- 
tems and optical radars. At optical frequencies quantum effects can be very significant 
in the detection of signals. In fact, there are many cases where quantum noise com- 
pletely dominates other noise sources in limiting the performance of optical systems. 

It is essential to have a good understanding of the properties of qu mtum measurements 
in order to design and evaluate quantum optical systems. We present a characteriza- 
tion of quantum measurements which communication engineers will find convenient to 
use. The study of the interaction between the information-carrying system and a mea- 
suring apparatus provides a suggestion for the physical realization of abstractly charac- 
terized quantum measurements. 

1 . z Characterization of Quantum Measurements 

It is a general assumption in quantum mechanics that a measurement on a quantum 
system is characterized by a self-adjoint operator, also known as an observable. Usu- 
ally the Hilbert space in which this self-adjoint operator acts is not well defined and 
sometimes it is not even mentioned. Frequently it is assumed that the Hilbert space is 
the one that includes all (but only) the accessible states of the system. That is. it is 
possible to put the system in any given state in this Hilbert space. Occasionally we can 
make use of a priori knowledge of how the quantum system has been prepared, and spec- 
ify the Hilbert space as 'the one that is spanned by the set of states that occur with non- 
zero a priori probabilities. Rarely is the Hilbert space considered as ar^ one that 
includes the set of accessible states as a proper subspace. It is only with such a defi- 
nition of the Hilbert space that every measurement is characterized by a self-adjoint 
operator. This definition of the space is often unacceptable, however, because we are 
seldom sure how big the Hilbert space has to be befc re a particular measurement can 
be characterized by a self-adjoint operator within the space. It is particularly clumsy 
for the communication engineer when he tries to find the optima measurement by opti- 
mizing over a set of such loosely and poorly defined measureir.e its. Therefore the com- 
munication engineer is interested in characterizing the set of all quantum measurements 
by operators acting in more well-defined Hilbert spaces, such as the space spanned 
by all accessible states or by the set of states with nonzero a priori probabilities. When 
defined on such spaces, not every measurement can be characterized by a self-adjomt 
operator. For example, Louisell and Gordon,^ and recently Helstrom and Kennedy 
and Holevo^ have noted that if the system under observation is adjoined with an appa- 
ratus, and a subsequent measurement is performed on both systems, the scope of 
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measurement can be extended to at least simultaneous approximate measurements of 
noncommuting observables. This particular type of measurement is important because 
it has been shown^ that minimum Bayes cost in communication problems may sometimes 
be achieved by such measurements. It has been suggested that the characterization 
of quantum measurements by operator-valued measurements is appropriate for quantum 
communication. Yuen‘S and Holevo^ have derived necessary and sufficient conditions on 
the operator-valued measures for optimal performances in detection problems. It seems 
that this characterization of measurement is useful at least in calculating optimal 
performances of quantum receivers. But such an essentially abstract mathematical 
characterization does not suggest how the measurement can be realized physically. 
Furthermore, it does not explain what happens to the system as a result of the measure- 
ment. This is in contradiction to the self-adjoint observable view of quantum measure- 
ment, where the observable can be expressed as a function of a set of generalized 
coordinates of the system and one can see what coordinates of the system the measure- 
ment should measure in some fashion. The von Neumann projection postulate gives the 
final state of a system after a self-adjoint measurement. So there are nice properties 
about a self-adjoint observable that are better than the operator-valued measure 
approach, particularly when the interest is in physical realization of quantum meafure- 
ments. An observable is usually considered to be physically measurable, in principle 
at least, while there has been no indication that any measurement characterized by an 
operator -valued measure can be measurable, even in principle. But it is very important 
for a communication engineer to optimize his receiver performances on a set of 
measurements that is at least physically implementable in principle. Recently Holevo 
has noted that for every operator-valued measure, one can always find an adjoining 
apparatus and a self-adjoint observable on the composite system such that the measure- 
ment statistics will be the same as those given by the operator -valued measure. In 
Part I, given the operator -valued measure, we show how the apparatus Hilbert space 
can be found and what the corresponding observable is. This constructive procedure 
we call our 'first realization of generalized measurements.' 

The method described here is not the only way to realize a generalized measure- 
ment. If we consider a sequence of self-adjoint measurements performed on the system 
alone, the statistics of the outcome sometimes correspond to those given by an operator- 
valued measure. We call this our 'second realization.' 

Since considerations of simultEineous measurement of noncommuting observables 
lead to the operator-valued measure characterization, we shall consider the simulta- 
neous measurement of two or more measurements characterized by operator-valued 
measures. 

Finally, we propose an alternative (but equivalent) characterization of generalized 
measurements. This characterization is potentially useful in considering estimation 
problems. 


1 . 3 Summary of Part I 


We address the mathematical problem of the extension of operator -valued measures 
to projector-valued measure on an extended space in Sections III and IV, (The results 
arc used in the proofs of theorems in subsequent sections. For a general appreciation 
of the results of this report, Section IV may be skipped.) The first realization of 
generalized measurement by adjoining an apparatus is described in Section V. Several 
properties of the extended space and the resulting measure are discussed in Section VI. 

In Section VII the dimensionality results are used to determine the dimensionality of the 
apparatus Hilbert space which is required for the first realization. . These results are 
also used in the second realization of several classes of generalized measurements by 
sequential measurements, which is developed in Sections VIII and IX with the main 
results given in Section X. Although not every operator-valued measure corresponds 
to a sequential measurement, in Sections XI and XII we have been able to show that a 
large class of measurements in quantum communication can be realized by sequential 
measurements with the same or arbitrarily close performances. In Section XIII we 
show that a simultaneous measurement of two or more generalized measurements corre- 
sponds to a single generalized measurement; hence, consideration of such measure- 
ments will not give improved performance. 

An alternative characterization of generalized measUi'enients is offered in Sec- 
tion XIV. 

1.4 Relation to Previous Work 

Holevo suggested^ the realization by adjoining an apparatus when he noted that 
NaYmark's theorem provides an extension of operator- valued measures to projector- 
valued measures on an extended space. The method of embedding the extended space 
in the tensor product space of the system and apparatus was found by the author, 

P, A, Benioff was working in the area of sequential measurements, at the same 
time that I was doing the thesis research for this report. His characterization of 
sequential measurement is similar to that given in Part I, Section VIII. 

Although self-adjoint observables in principle can be measured, very few of them 
correspond to known implementable measurements. In Part II, by means of an inter- 
action between the system under observation and an apparatus, we shall show how the 
relevant information may be transformed in such a way that by measuring a measurable 
observable we can obtain the same outcome statistics of the abstractly characterized 
measurement. The type of transformation that is required and the means of finding the 
required interaction Hamiltonian are shown. Inferences are drawn about which coordi- 
nates of the system and apparatus should be coupled together, and in what fashion. The 
constraints of physical law on the 'allowable' set of interactions are discussed. 


3 






II. GENERALIZATION OF QUANTUM MEASUREMENTS 


In quantum mechanics it is generally assumed that em observable of a quantum sys- 
tem is characterized by a self-adjoint operator defined on the Hilbert space describing 
the state of the system. Let us ccill this operator K, and assume that it has a complete 
set of orthonormal eigenvectors { I associated with distinct eigenvalues 
where J is some countable index set, and 


K|kj> = kjkj). 


0 ) 


Each commuting and orthogonal projection operator {ll^ s |k^) projects an 

arbitrary vector of the Hilbert space into the subspace spanned by | kj^) and together they 
form a complete resolution of the identity; that is, 


s n . = I, 

iey ^ 


( 2 ) 


where I is the identity operator. 

When the measurement characterized by the operator K is performed, one of the 
eigenvalues k. will bo the outcome, and the probability of getting k. is 


p(k^) = <s|nj|s), (3) 

if the system is described by a pure state | s), or 

P(kj) = Tr{pgn.}, (4) 


if the system is described by the density operator p . 

s 

This formulation of the measurement problem does not include all possible measure- 
ments. For example, it does not encompass a simultaneous measurement of noncom- 
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muting observables. Louisell and Gordon and recently Helstrom and Kennedy and 
Holevo have noted that if the system S is made to interact with an apparatus A and 
subsequent measurements performed on S+A or A alone, the scope of mea.«Jiirement can 
be extended to at least simultaneous approximate measurements of noncommuting 
observables of S. In particular, we can perform measurements corresponding to a set 

defined on UCg, the 
system Hilbert space, which forms a resolution of the identity in 3Cg. 

S Q. = I. (5) 

ie3C ^ 

T'' illustrate this possibility, we consider the interaction of the system S with an 
apparatus A. Before interaction the joint state of S+A can be represented by the density 


of noncommuting, nonorthogonal, self-adjoint operators 
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operator 


P^A = PS° ® Pa 

defined on the Tensor Product Hilbert Space 3Cg ® 3C^ = ^Sf A' ® denotes tensor 

product. The result of the interaction is a unitary transformation on the joint state. At 
any arbitrary time t later than t^, the density operator of the combined system and appa- 
ratus is 



where U(t, t^) is the unitary transformation. 

Let {nj(t)}jgy be a set of commuting, orthogonal projectors in 3Cg ® 3C^ at the 
time t. If we perform a measurement characterized by the the probability of getting 
the eigenvalue corresponding to the subspace into which 11^ projects is 

P(kj) = Tr{p*g^^nj(t)}. (8) 

Let 

ni(to) = u"^(t,to)ni(t) U(t.to>. (9) 

The {n j(t^)}.gy again form a commuting, orthogonal, projector-valued resolution of 
the identity in 3Cg ® 3C^, and 


t t 

P(kp = Tr{pg° ® 

Defining 

Q^ = Tr^{pl°nj(t^)}, 

where Tr^ indicates taking partial trace over we obtain 


( 10 ) 


( 11 ) 


P(k.) = Trg{p^gQ.}, (IE) 

where Trg indicates taking trace over JCg. 

The set is again a resolution of the identity but in general the are not 

orthogonal nor commuting; furthermore, they only have to be nonnegative-definite self- 
adjoint operators. It can be shown that if the are projectors it is necessary and 
sufficient that they be orthogonal (see Appendix A for a statement of the theorem that 
is due to Halmos). This particular form of measurement is important because it has 
been shown® that minimum Bayes cost in communication problems may sometimes be 
achieved by such measurements. 
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III. THEORY OF GENERALIZED QUANTUM MEASUREMENTS 


We shall now specify a generalized theory of quantum measurements that does not 
correspond necessarily to measurements characterized by self-adjoint operators on 
the Hilbert space that describes the system under observation. 

As we have noted, an observable is characterized by a self-adjoint operator K that 
possesses a set of orthogonal projection operators {II J such that Z IIj = I. The set of 

projection operators is said to form a commuting resolution of the identity , and defines 

a proje ctor-valued measure on the index set {i}. 

This characterization of quantum measurements does not conveniently take into 
account the simulteuieous approximate measurement of noncommuting observables, and 
it is necessary to consider more generalized measurements characterized by 'gene£- 
alized' resolutions of the identity . {See refs. 9-11 for more detailed motivation and 
discussion.) 

The requirement that the II. be projection operators is relaxed by replacing the Ilj 
with nonnegative -definite operators Q. having norms < 1, so that Z = I. Now the 'mea- 
surement operators' Q. no longer have to pairwise commute, nor are they orthogonal 
to each other in general. The Q. then define an operator -valued measure on the 
index i. 

Sometimes the resolution of the identity does not have to be defined on countable 
index sets sqch as the integers. For example, the index set can be the whole real line. 

We shall now discuss more general definitions of resolutions of the identity. Some of 
the terminology will be required for the discussion of estimation problems, although the 
foregoing is generally adequate for detection problems 

DEFINITION 1. A resolution of the identity is a one-parametex’ family of projections 

■fE. } ^ . which satisfies the following conditions: 

_00<\<+00 

®\^|x “ ®min{\, |x) 


(ii) 

(iii) 


^-00 ” ^+00 “ ^ 


®\+0 " 


where 


E , _x = lim E.x 

±00 . , \ 

\-*±00 




with X being an element in the space 3C. 


(13) 
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Such a family of operators defines a projector -valued measure on the real line (R, 

For an interval A = where Xj < X 2 , the measure E(A) = is a projec- 

2 1 

tion operator. It follows from condition (i) that for two disjoint intervals A^ on tlie 
real line 

E(Aj) EiA^) = 0. (14) 


In fact, this orthogonal relation is true for two arbitrary disjoint subsets of thf* real line 
(see Appendix A). In this sense the resolution of the identity Ej^ is also called an orthog - 
onal resolution of the identity . 

For a small differential element dX, the corresponding measure is dEj^ = E(dX) = 
\fdx - ^X* 


The integral 

100 


r-tOO 

J-00 


X dE^ 


(15) 


converges in strong operator topology, and defines a self-adjoint operator in the Hilbert 
space 3C. Conversely, by t'le spectral theorem for self-adjoint operators (see Appen- 
dix B), every self-adjoint operator possesses such integral representation. The 
family is called the spectral family for the operator A, 

Sometimes the projector-valued measure is defined on a finite number of discrete 
points (for example, the points may be the integers i = 1, . , . , M), and it is often more 
convenient to write the measure corresponding to each point i explicitly. The mea- 
sures {llj^} are projection operators and they sum to the identity operator 

S n = I. (16) 

i 

The orthogonality condition in Eq. 13 becomes 


n.n. = 6. .n., 

I J IJ J 


(17) 


where 6. . is the Kronecker delta, 


6 = P 

' ‘'ij \0 i^j- 


To reconstruct the resolution of the identity given in the definition, we define 

E. = EH. 
i^X ^ 

and {E, } will have all desired properties of a resolution of the identity. 


(18) 


EXVMPLE 1 

If a self-adjoint operator A has a set of eigenvectors {| that forms a com- 

plete orthonormal basis for the Hilbert space 3C, then A c£in be written 
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{ 19 ) 


M 

A = S aJa^Xa-l. 
i=l 

where the are the real eigenvalues of A. 

The aet of projection operators 

n.= |ai><aj (20) 


forms a projector-valued measure on the integers, i = I, . . . , M, and they sum to the 
identity operator 


M 

s n. = I. 
i=i ^ 


(21) 


DEFINITION 2. A generalized resolution of the identity is a one-parameter family 


of operators {f. }_ 


\J-W <\<+00 


that satisfy the following conditions: 


(1) If Fv -F. is a bounded nonnegative-definite operator (which also 

implies that it is self-adjoint) 

(iii) F_„ = 0, F^^ = I. . (22) 


Such a i'amily of operators defines an operator-valued measure on the real line. For 
example, " we have an interval A = (XpX.^]^ where < \ 2 * measure is F(A) = 

F. “ F. ■ For a small differential element d\, the corresponding measure is 

^2 ^1 00 
dFj^ = F(d\) = - F^. Whenever the integral A = /_„ XdFj^ converges in strong 

operator topology, it defines a symmetric operator A in the Hilbert space 3C (i. e. , its 

domain is dense in 3C; and for f, g G D^, (Af,g) = (f, Ag)) and the family {F^^} is 

called the generalized spectral family for the operator A. 

A projector-valued measure is a special type of operator -valued measure, but 
operator-valued measures are more general in the sense that the measures are 
nonnegative-definite self-adjoint operators instead of being restricted to projection 
operators, as in projector-valued measures. One of the consequences of this defini- 
tion of measure is that the measures of two disjoint subsets of the index set do not have 
to be orthogonal as in projector -valued measures. 

EXAMPLE 2 

An example of an operat r-valued measure that is not a projector-valued measure 
is when {EJ^}, {Ej^} are two projector-valued measures that do not commute for at least 
one value of X, and we form the generalized resolution of the identity 

Fj^ = aE'^ + (l-o)E^, 


(23) 



where a is a real parameter in the interval {0, 1). Specifically, defines an operator- 
valued measure, but not a projector-valued measure, on the real line. 

As in a projector- valued measure, sometimes an operator -valued measure is defined 
on a finite number of discrete points (for example, the points may be the integers, 
i = 1, . . . , M) and it is more convenient to write the measure corresponding to each 
point i explicitly. The measures are nonnegative -definite self-adjoint operators with 
norm =S1. To reconstruct the resolution of the identity given in the definition, we 
define 


F. = £ Q., 


(24) 


and {Fj^} has all of the desired properties of a resolution of the identity. 
EXAMPLE 3 

Figure 1 shows three vectors | s^^), i = 1, Z, 3 with the symmetry 

<®il®j) j. 

We define 


<^i=T l^iX^il- i= 1.2.3. 


Then 


-J 

i=l 


Q. = I 


(25) 


( 2 ">) 


(27) 


and 


Qi Qj. 


(28) 


Thus is an operator-valued measure but not a projector-valued measure on 

the space spanned by the { j s^) }. The operator-valued measure {Q.} above is defined 

on the real line <R. We can also define operator- 
valued measures on general measurable spaces. 

If (X, ja/) is a measureable space, where X 
is the space, and s/ a collection of subsets of X 
on which an appropriate measure can be defined 
(for example, can be a o- -algebra, v-ring, 

0 - -field, etc.), a map F(.) can be defined as 
follows. 

For all subsets A e jj/, A F(A), where 
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IHlIlitEi 




(i) F(A) is a bounded nonnegative -definite self-adjoint operator. 

(ii) The map F( . ) is countably additive, i. e. , for any countable number of pairwise 
disjoint subsets in si , {Aj^}, isay. 


F(U A.) = ^ F(A.). 


(iii) F{X) = I, the identity operator in 3C, so F( . ) is a resolution of the identity. 

(iv) For the null set 0 , F(0) = 0. / 

EXAMPLE 4 

1 2 

The output of a laser well above threshold is in a coherent state . A coherent state 
I a) is labeled by a complex number a, where the modulus corresponds to the amplitude 
of the output field, and the phase of o corresponds to the phase of the field. The inner 
product between two coherent states jo), jp) is given by 


(o|(5> = exp |o*p - I joj^ - ^ 


The coherent states can be expressed as a linear combination of the photon states 
I n), n = 0, 1,,. . . where the integer n indicates the number of photons in the field 

n=,0 

The Hilbert space 5C that describes the field is spanned by the set of photon states 

{l">}n=0 

2 ln><nl=l3c- 

n=0 

If we define 

{nn = |n><n|}^,0. 

then the set of projectors {ll^} is a projector-valued measure defined on the positive 
integers of the real line. 

The set of coherent states also spans 3C, and the integral 
r jo) (o| d^o = Ijg, 

where C is the complex plane, and d^o = dlm(o) dRe(o). If we define 






{Qa = l“> <“i}aeC* 


(35) 


we have an operator -valued measure {Q^} defined on the complex plane C instead of on 
the real line and 

Q Q , :jt Q 6 , (^^) 

o' aa' 

so it is not sin orthogonal resolution of the identityt 

A measurement on a physical system can be characterized by an operator-valued 
measure, with the outcome of the measurement having values in (or labeled by elements 
in) X. The probability of the outcome falling within a subset A e ja/, is given by 
Tr {pF(A)}, where p is the density operator for the system under observation. When 
a measurement is characterized by a single self-adjoint operator, sometimes called an 
observable, the measures are all projector-valued. Here the measures are generalized 
to nonnegative self-adjoint operators with norms ^1. A natural question arises. How 
do we realize such generalized measurements? Does every operator-valued measure 
correspond to some physical measuring process? In the sequel we shall prove the fol- 
lowing major theorem, which will be restated in more precise mathematical language 
in Section V. 

Theorem 1 

Every operator -valued measure can be realized as corresponding to some physical 
measurement on the quantum system in question in the following sense. 

(a) It can always be realized as a measurement corresponding to a self-adjoint oper- 
ator on a composite system formed by the system under observation and some adjoining 
system that we call the apparatus. 

(b) Under suitable conditions that will be specified later, it can be realized as a 
sequence of self-adjoint measurements on the system alone./ 

We shall give a simple example showing when an observable cannot provide the 
information that we desire and hence generalized measurements have to be used. 

Consider the situation in which the information to be transmitted is being stored in 
the orientation of the spin of an electron. The electron is in one of three possible states, 
just as those described in Example 3. A spin measurement performed on the electron 
(that is, a Stern-Gerlach experiment) can have only one of two possible outcomes. This 
measurement is clearly unacceptable for distinguishing among three possibilities, and 
it is necessary to bring in an apparatus to interact with the electron. The subsequent 
measurement on the composite system will give the desired outcome statistics. 
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IV. EXTENSION OF AN ARBITRARY OPERATOR- VALUED MEASURE 
TO A PROJECTOR -VALUED MEASURE ON AN EXTENDED SPACE 

We are now concerned with the proof of Theorem 1 and we provide two construction 
procedures for the extension space and extended projector-valued measure. For readers 
who are interested neither in the proof nor in the construction, this section may be 
skipped without inhibiting understanding the rest of the report. Reading Example 5, how- 
ever, may be very instructive. 

In order to prove Theorem 1 we need some preliminary mathematical results. First, 
we want to investigate the extension of an arbitrary operator- valued measure to a 
projector-valued measure on an extended space. Two slightly different methods of exten- 

Sion will be offered, since each has its own merits. 

Holevo^ has noted that Naimark’s theorem provides such an extension. 

Theorem 2 (Naimark's Theorem ) 

Let F be an arbitrary resolution of the identity for the space 3C. Then there exists 
a Hilbert V^ce JC*" containing K as a subspace, and there exists an orthogonal resolu- 
tion of the identity eJ" for the space JC+, such that F^f = all f e 3C, where 

is the projection operator into 3C. / 

The proof, which provides an actual construction, is given in Appendix C. 

The second method of extension is related to the unitary representations of 

semigroups. 

DEFINITION 3. Let G be a group. A function T(s) on G whose values are bounded 
operators on a Hilbert space 3C is called positive semidefinite if T(s ) = T(s) , for 
every s S G and 

2 ^ {'T(^~^s)h(s).h(t)} 2=0 (2'^’ 

s£G tec 

for every finitely nonzero function h{s) from G to 3C {that is, h(s) has values different 
from zero only on a finite subset of G)./ 

DEFINITION 4. A unitary representation of the group G is a function U(s) on G, 
whose values arc unitary operators on a Hilbert space 3C, which satisfies the conditions 
U(e) = I (e being the identity element of G), and U(s)U(t) = U(st), for s, t £ G./ 

The following theorem is due to Sz.-Nagy. 

Theorem 3 

(a) If U{s) is a unitary representation of the group G in the Hilbert space 3C , and 
if 5C is a subspace of JC + . then T(s) = P;^L-{s)/X is a positive-definite function on C. such 
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that T(e) = Moreover, if G has a topology and U(s) is a continuous function of s 

(weakly or strongly, which amounts to the same thing because U(s) is unitary), then T(s) 
is also a continuous function of s. 

(b) Conversely, for every positive -definite function T(s) on G. whose values are 
operators on 3C, with T(e) = there exists a unitary representation of G on a space 
containing 3C as a subspace such that 
T(s) = p3(,U(s)/3C for s S G, 

and the minimality condition for the smallest possible 3C is iven by 


= V U(s) 3C 
sGG 


(minimality condition). 


(39) 


This unitary representation of G is determined by the function T(s) up to an isomorphism 
so that it is called "the minimal unitary dilation" of the function T(s). Moreover, if the 
group G has a topology and T(s) is a (weakly) con.muous function of s, then U(s) is also 

a (weakly, hence also strongly) continuous function of s./ 

[Notes. In (a) the solidus indicates that the operator is restricted to operation on 
elements in 3C. In (b) U(s)3C means the ,'^et of all elements U(s)f. ^f e V .Y/ j 

is defined as the least subspace containing the family of subspaces An isomor- 

phism between two normed linear spaces 3C^ and 3C^ is a one-to-one continuous lin- 
ear map M : 3C^ — 3C^ with 

The proof, which also provides a construction, is given in Appendix D. 

Given Theorem 3, we arrive at the following theorem for the extension of arbitrary 
operator -valued measures. 


Theorem 4 

Let {f [ be an operator-valued measure on the interval 0 X 277, then tliere exists 
a projector-valued measure {e^} in some extended space c 3C s uch that 
for all \ . / 

The proof is given in Appendix E. 

Note that the minimality condition of Theorem 3 

= V U(n) 3C 
n=0 

is equivalent to 

5C^ = VE.3C. 

X ^ 

and the system (3C, JC+,{eJ) is determined up to an isomorphism. Also, the interval 
of variation of the parameter X, (0,277) can be extended to any finite or infinite interval 
by using a continuous monotonic transformation of the parameter X. 
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EXAMPLE 5 (see Chan^'^) 


In Example 3 we gave an operator-valued measure that is not a projector-valued 
measure. Three vectors have the structure shown in Fig. 1. We define 

Qi =-| Ui) <Si| , i = 1,2. 3. (42) 

Then 

3 

S Q. = I,.. (43) 

i=l ' ^ 

where denotes the identity operator of the two-dimensional Hilbert space 3C spanned 
by the three vectors {| Sj)}?_j. Pick any extra dimension orthogonal to 3C to form <1C^ 

together with 3C. Let be an orthonormal basis for the three-dimensional space 

3C as shown in Fig. 2, By symmetry considerations, we adjust the axis of the 



Figure 2. Configurations of H. = | <j). ) ( <j>. I . 

coordinate system made up of the to be perpendicular to the plane 3C spanned 

by the {| s. ) K The projections of the | <t>. ) on the plane of the 1 s^ ) along the axis are 
adjusted so that they coincide with their respective 1 s. ). so that j{4). 1 s^^) | = constant 
for all i is maximized (see Fig. 2). By straightforward geometric calculations, 

|<<(,Js.)|^ =1 (44) 

and 

(45) 

Hence 

PjcUi) (<f>i|p3c “T ^®i^ ^®i' " 
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V. FIRST REALIZATION OF GENERALIZED MEASUREMENTS: 

FORMING A COMPOSITE SYSTEM WITH AN APPARATUS 

Given Theorems Z and 4, we can prove immediately part (a) of Theorem I . Firsti 
we must define some mathematical quantities in order to state the theorem more pre- 
cisely. We follow the procedure suggested by Holevo,^ although he did not give a detailed 
development. 

We combine two systems, say S and A, to form a composite system and if 3Cg and 
3C^ are the Hilbert spaces that previously describe their individual states, then the joint 
state of S+A can be described by the Tensor Product Hilbert Space 3Cg ® formed by 
the tensor product of the two spaces 3Cg and Thus if the state of S is |s) and 

the state of A is | a), in the absence of any interaction between S and A the joint state r 

of S+A is denoted by | s)| a) (Dirac notation is used for states). Moreover, every ele- 
ment in 0 3C . is of the form S c. | s. ) | a. ). where the c. are complex numbers such 
.A. i ^ ^ ^ ^ 

that S 1 cj ^ < 00. and the 1 s.) and the 1 a^) are elements in 3Cg and 3C^. respectively. 

The inner product on JCg 0 3C^ is induced in a unique way by the inner products on 
the constituent spaces 3Cg and so that 

(<ajt<sJ.ls2>|a2>) = <Sj|s2> (aja^). (48) 

It is an immediate consequence of this structure that if we have a set of complete 
orthonormal basis for each of the two spaces 3fCg and then the set of tensor products 
of the elements in these two sets, taken two at a time, one from each set. forms a com- 
plete orthonormal basis for 3Cg 0 That is. if {| and {| ^ 

of complete orthonormal basis for 3Cg and 3C^, then the set {| s. ) | a^)}.^.^^ j forms 

a complete orthonormal basis for the space Kg 0 the eleinents of which cannot be 
separated into the tensor product of an element in 3Cg and an element in but it is 
possible to express every element in 3Cg 0 as a linear combination of elements that 
are separable. 

Given this definition of the space Kg 0 3C^. the operators in this space can be defined 
easily. If Tg and T^ are bounded linear operators in Kg and K^, then there is a unique 
bounded linear operator Tg 0 T^ in Kg 0 K^ with the property that 

(Tg 0 T^)(|s)la>) = (Tgls)) • (T^|a>) (49) 

for all I s> e Kg and all | a> e K^, 

Tg 0 T^ is called the tensor product of the operators Tg and T^. Thus if the state 
of S is described by the density operator pg and the state of A by p^. we can show that 
in the absence of interactions the joint state is given by the operator Pg 0 Pa- By lin- 
earity. the operation of the operator Tg 0 T^ can be extended to arbitrary elements 
in Kg 0 K^. Again, the most general operator on Kg 0 K^ cannot be written in the 
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form of the teneor product of two operators as above, but can be expressed as a linear 
combination of such product operators, and linearity defines the operations uniquely on 
elements in Kg ® 3C^. 

It is obvious that this description may be extended to describe a composite system 
with arbitrarily many {but finite) component systems, instead of two. 

For the moment, this concludes the characterization of composite quantum systems. 
We shall discuss the dynamics of such systems when we talk about interactions 
(Sec. XVII). 

Now we are able to state part (a) of Theorem 1 more precisely. 


Theorem 1 

(a) Given an arbitrary operator-valued measure where A is one index 

set on which the measure is defined, we can always find an apparatus with a Hilbert 
space 3C^, a density operator p^, and a projector- valued measure corre- 

sponding to some self-adjoint operator O = S q„II„ on 5C„ ® K . such that the proba- 

c(EA ^ 

bility of getting a certain value q^ corresponding to as the outcome of the 
measurement is given by 

P(q.) = Trg {pgQj 

” '^s+At'’s ® PAnJ' 


for all density operators Pg in 3fCg, where Trg is the trace over Kg and TVg . is the 
trace over Kg ® K^. / 

[Note. The trace of an ope”ator D over a space K is defined as Tr{D} = S ( f J Djf. ), 

where {jf|)} is any complete orthonormal basis of K. This quantity is independent 
of the particular choice of basis.] 


Proof; We know from Theorems 2 and 4 that an arbitrary operator-valued measure 
{q . with operator -values on the space K_ can be extended to a projector-valued 
measure with operator-values on an extended space K^ that contains Kg as a 

subspace. K"*" can be embedded in a tensor product space Kg ® K^ for some apparatus 
Hilbert space with enough dimensions. The question of how many dimensions are 
required will be addressed later. For the moment, assume that K^ has enough dimen* 
sions that the dimensionality of the space Kg ® K^ is greater than or equal to that of 
K^. If the state of the apparatus is set initially at some pure state |a), then the joint 
state of S+A can be described as the tensor product pg ® |a) (a| of a density operator p 
in Kg, and the density operator = |a) ^ ^A’ every element | s) in Kg 

it can be identified as the element | s)|a) in Kg ® K^. And the whole space Kg can 
be identified as the space Kg ® where is the one -dimensional subspace 

of K^ spanned by the element ja). Now K = Kg ® ^ proper subspace of 
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® ^A’ projection operator into the subspace 3C can be identified as Pjg = 

Ijg ® [ a) ( a| , where the set {] Sj>} is any orthonormal basis in 3Cg. We can form an 

operator-valued measure {q^ ® |a> with values in the space 3C. By Theorems 2 

and 4, there exists a projector-valued measure on an extended space -'JC'*' that 

We can take as 3Cg ® since we have assumed that has enough dimensions! so 
that 

Qg ® !a)<a| = Pjgll^Pjg, e A, ( 51 ) 

Now for an arbitrary density operator Pg in 3Cg, 

= '^*’S+A^<PS ® U> ( HQa ® ! a>< 

= Trg+A^<Ps ® l^> (52) 

With the relation Tr {bc} - Tr {cb}, 

= Trs^Afr3c'l>s ® > < “I > Vj- (53) 

But Pg ® |a) (a| is an operator in 3C. Hence 

“ Ps ® ' (54) 

Therefore 

Tr {pgQ^} = Trg^^{pg 0 |a) <a|n^}, (55) 

for any arbitrary density operator pg. Note that 
Q^ = <a|(Q^® |a>(aj)|a) 

= Tr^{(Q^® |a>(a|)(L^ 0 |a><a|)} 

s 

= Tr^{Pj|,n^ jKj} 

= ’■■■*{<'»: ® Ux^linJ 

s 

= "^^A^^^Kg ® Pa> V’ (56) 

where Tr^ denotes parUal trace over the space 3C^. The partial trace of an operator D 
in 3Cg ® over the apparatus Hilbert space 3C^ is defined as the operation 


where {| s^)}> {|a^)} are complete orthonormal bases in 3Cg and respectively. 
EXAMPLE 6 

We shall make use of the operator-valued measure described in Examples 1 and 5. 

In Example 5 we have already found the projector-valued measure extension in 

the three-dimensional extended space 3C^, If we consider the original two-dimensional 
Hilbert space 3C as the system space Kg, we only have to find an apparatus whose state 
is described by a Hilbert space 3C^, and then embed K"*" in the tensor product Hilbert 
space 3Cg ® Any apparatus Hilbert space of dimensionality >2 will work (dimen- 

sionality of 3Cg ® 3C^ will be > 4). Let = | a) { a| , where 1 a) is some pure state in 
Therefore the three possible joint states of S+A are {| Sj)| ^^hey span 

a two-dimensional subspace in Kg ® namely. Kg ® | where is the 

Bubspace spanned by |a). Choose any other one- dimensional subspace 

3Cg ® K^ orthogonal to Kg ® Then the space Kg ® V (=K ) is 

three-dimensional and includes Kg ® -^|a) ® subspace. Hence three orthog- 
onal projectors can be found in K'*', so that they are the extensions of the cor- 

responding operator -valued measures {QjIj-j (see Example 5 for the structure of the II.) 
Let I^ be the identity operator of the space Kg ® K^ - {Kg ® V 


n; = n. ® i , 

i 1 d 


for i = 1,2,3. 


(57) 


Then 


Z. "i " ^Kg® K^ 


(58) 


and 


Tr^{(IjC ® U><a|)II'J= Tr^{(Ij^, ® |a)<al)n.} 
S S 

= Qj, for i = 1, 2, 3. / 


(59) 
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VI. PROPERTIES OF THE EXTENDED SPACE AND THE 
RESULTING PROJECTOR- VALUED MEASURE 


We shall now examine the properties of the extended Hilbert space and the resulting 
projector- valued measure. The most important property is the dimensionality of the 
extended space, and it is important for two reasons. First, it will tell us the required 
minimum number of dimensions of the apparatus Hilbert space. In a communications 
context, the apparatus should be considered as a part of the receiver. If the dimension- 
ality of the extended space is known, we have some idea of the complexity of the 
receiver. Second, the analysis of the minimum dimensionality of the extended spa' (> 
is absolutely necessary for the discussion of the realization of generalized measurements 
by sequential techniques in Section X. ^ 

When little is known of the properties of the operator-valued measure, Theorem 4 
is powerful. It provides an upper bound for the dimensionality of the extended space 
whenever the cardinality of the index set, on which the measure is defined, is given. 

For example, in the M-ary detection problem, we try to decide on one of M different 
signals. The characterization of that receiver is given by an operator- valued measure 
defined on an index set with M elements corresponding to the M possible outcomes of 

the decision process. That is, we have M different 'measurement operators* 

M 

that form a resolution of the identity S = I. If the density operator of the message- 
carrying field is p, the probability of choosing the k^^^ message is Tr {pQj^j. The detailed 
properties of the optimum depend heavily on the states of the received field and the 
performance criterion that is chosen. Without going into a more detailed analysis of 
the communication problem, all that we know about the quantum measurement for an 
M-ary detection problem is that it is characterized by M 'measurement operators' 

{q.}. , . Theorem 5 is useful for this kind of situation. 

11—1 


Theorem 6 

M 

For an arbitrary operator-valued measure {Q.}. , , 2 Q. = I, whose index set has 

i=l ' 

a finite cardinality M, the dimensionality of the minimal extended Hilbert space, min 3^ , 
is less than or equal to M times the dimensionality of the Hilbert space JC. That is, 

dim {min JC^} <M dim (60) 

The proof of Theorem 5 is given in Appendix F. 

We shall show eventually that there exists a general class of {Qj} such that the upper 
bound is actually achieved. In the absence of further assumptions on the structures of 
the Q., this is the tightest upper bound. 

If more structures for the operators Q. are given, we can determine exactly how 
large the extension space has to be. Theorems 6 and 7 provide us with that knowledge. 


t 


i 


20 



Theorem 6 


If the operator-valued measure has the property that every is propor- 

tional to a corresponding projection operator that projects into a one-dimensional sub- 
space of 3C (l.e.» Qfl = % ! Qq ) ( I » where 1 > > 0, and |q^) is a vector with 

unit norm), then the minimal extended space has dimensionality equal to the cardinality 
of the index set A (card {a}). That is, 


dim {min JC'*'} = card {a}. / 

The proof of Theorem 6 is given in Appendix G. 


( 61 ) 


Theorem 7 


Given an operator-valued measure let <R{q^} denote the range space of 

{Q„}i a e a. Then 


dim {min = S dim {61{Q }}. / 
a€A ® 

The proof is given in Appendix H. 

Given Theorems 6 and 7, we can make some interesting observations. 


(62) 


COROLLARY 1 . It is an immediate consequence of the proof of Theorem 7 (see 

Appendix H) that the statistics of the outcomes of measurements characterized by some 

operator-valued measure can be obtained as the 'coarse -grain' statistics of 

the outcomes of a measurement characterized by a set of one-dimensional operator- 

K ~ 

valued measures {p“ = q“ I q?> <q“l I “ (see Kennedy^ ^). By considering the 

I f G L 

associated set of one-dimensional operator-valued measures instead of {q^} no 
additional complications will be introduced, since the minimal extensions of the two sets 

r 1 f 

of measures are exactly the same. In this sense the two sets {Q } ^ . and { pf “ 

I. kj, _ , _ . 

are 'equivalent'./ K-i,aeA 

COROLLARY 2. If all of the operators are invertible (that is, if each of their 
ranges is the whole space 3C), then 

dim {min = card {a} * dim {3C}./ (£, 3 ) 

The proof is obvious with Theorem 7 . 

Note that the upper bound of Theorem 5 is exactly achieved when all the Q are 
invertible. 


COROLLARY 3. The construction of the projector- valued measure and the extended 
space provided by NaTmark's theorem (Theorem 2) is always the minimal extension. / 
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The proof is given in Appendix I, 

EXAMPLE 7 

In Example 5, the operator- valued measure {q. | has the property 

that each operator is proportional to a one -dimensional projector. Hence, either by 
Theorem 6 or Theorem 7 , the dimensionality of the minimal extended space should be 
equal to the cardinality of the index set which is three. Therefore the extension given 
in Example 5 is minimal. It is clear from Example 5 that the projector-valued extension 
has to be defined at least on a three-dimensional space./ 

DISCUSSION. Theorems 5, 6, and 7 hold when the dimensionality of f.ie Hilbert 
space 3C is countably infinite (=k^). but we must be careful in interpreting the results. 
The following rules are useful for cardinality multiplication: 

Finite cardinality is indicated by an integer. 

Countably infinite cardinality is indicated by k^. 

Uncountably infinite (or continuum) cardinality is indicated by k ^ 

integer • integer = integer, 

integer • « = k , 
o o 

^ integer _ ^ 

^ o o’ 



In Theorem 5, the dimensionality of the minimal extended space min is given by 
dim {min (fC^} dim {3C}. Thus if dim {3C} = k , then dim {min 5C^} = M • k = k also. 
This does not mean min 3fC = 3C. If we examine the proof of that theorem closely, the 
minimality statement really means 

dim {min SFC"'' -3C} = k^. (64) 

The Jieason is that with the space 3C we need (M-1) dim {3C} = (M-I)k^ = number of 
dimensions for the extension. (This holds even if M goes to infinity because ’ k = /c .) 

This is also true for the result of Theorem 6 which states dim {min 3C'*’} = card {a}. 
In the event that card {a} = k^, the result should be interpreted very carefully. Let A' 
be a subset of the index set A such that for all a e A’, 1 ^ • This means for all the 

a e A-A', = I i- -d is already a projector that requires no extension. Hence all 

the 'extra' dimensions required in rain SC'*’ are for those with a S A'. Thus we have 
the following Interpretation of the result of Theorem 6 . 

dim {min ?C^-3C} = card {a'} - dim S (65) 
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where 'I indicates the range space of the operator in braces. Obviously card {a} 
can be finite or infinite. Accordingly the 'extra* dimensions needed to form min 5C 
from 3C are finite or infinite. 

Similar interpretations should be made for the result of Theorem 7 . In Corollary 1 
we note that the extension in Theorem 7 is structurally similar to that in Theorem 6, so 
the same interpretation applies. If we follow the proof of Theorem 7» we arrive at the 
following result (which we shall not derive in detail) . 


dim {min - 3fC} =5 S dim f <R{liiii (Q 

a€.A I n-oo J 

lim(Q^-Q“)}]., 
n— 00 J 


- dim 


((fi{ S 

I aeA 


( 66 ) 


The result for Theorem 6 is a special case of this one. / 
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VII. APPARATUS HILBERT SPACE DIMENSIONALITY 


We are now in a position to make some general comments about the complexity of 
the apparatus at the receiver of a quantum communication system. Bear in mind that 
the dimensionality of a tensor product Hilbert space 3Cg ® 5C^ is given by 

dim {3Cg 0 3C^} = dim {JCg} * dim (67) 

We may state the following theorem for the minimum dimensionality of the apparatus 
Hilbert space. 

Theorem 8 

If the system Hilbert space 3Cg is extended first to the space SC"*" D IfCg and 3C^ is a 
minimal extension, then the minimum number of dimensions of the apparatus Hilbert 
space required for a realization of the measurement described in the sense of 
part (a) of Theorem 1 is given by the smallest cardinal N such that 

N • dim {5Cg} ^ dim {min 3C"*"}. / (68) 

The proof is obvious. 

In the absence of detailed knowledge of the nature of the operator -valued measure. 
Theorem 5 gives the following theorem. 

Theorem 9 

For an arbitrary operator-valued measure S = 1^, whose index set has 

a finite cardinality M. the minimal dimensionality of the apparatus Hilbert space 3C^ 
required to guarantee an extension of the measure to a projector-valued measure in the 
tensor product space 3Cg 0 3C^, is equal to M./ 

Proof ; The inequality in Theorem 5 asserts 
dim {min dim {3Cg|. 

If we make dim {^^1 = 

dim {J5Cg0 = dim {3Cg| • dim {^C^} 

= M dim {3Cg| > dim {min 3C^}. (69) 

Hence we can always guarantee an extension. Since we show in Corollary 2 that the 
bound can be achieved for some classes of measures. M is the minimum dimensionality 
that will always guarantee an extension. / 

The implications of the theorem are very interesting. One of the main reasons for 
our investigation of measurements characterized by generalized operator-valued 
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measures is that we hope to improve receiver performances by optimizing over an 
extended class of measurements that are not completely characterized by self-adjoint 
operators. Theorem 5 tells us that if we are interested in the M-ary detection problem, 
all we have to do is to adjoin an apparatus with an M -dimensional Hilbert space and 
consider only measurements characterized by self-adjoint operators in the tensor prod- 
uct Hilbert space 3Cg <8> 3C^. 

The following theorems are immediate consequences of Theorems 6, 7, and 8. 
Theorem 10 

If the operator-valued measure has the property that every is propor- 

tional to a corresponding projection operator that projects into a one -dimensional sub- 
space of 3C (that is, |q^ ) 1 1 where I 0, and jq^) is a vector with 

unit norm), then the minimum number of dimensions of the apparatus Hilbert space 
required for a realization of the measurement described in the sense of part (a) of The- 
orem 1 is given by the smallest cardinal N such that 

N dim {3fCg} 5 card {a}. / (70) 

Theorem 1 1 

Given an operator- valued measure {Q^j}g,2A’ denote the range space of Q^, 

a e A. Then the minimum number of dimensions of the apparatus Hilbert space required 
for a realization of the measurement described in the sense of part (a) of Theorem I is 
given by the smallest cardinal N such that 

Ndim{3Cj& S dim{(R{Q}}./ (71) 

^ aSA ® 

The proof is obvious. 

EXAMPLE 8 

In Example 6 we showed how the extended space in Example 5 can be embedded in 
a tensor product Hilbert space of 3Cg and an apparatus Hilbert space We noted that 

the space must be two-dimensional or bigger. The results in this section confirm 
that the dimensionality for 3C^ must be at least two. 

DISCUSSION. We must be careftil in interpreting the results of this section when 
the dimensionality of the Hilbert space 3Cg is infinite. 

In Theorem 8 when both dim {3C_} = dim {min SC*"} = k (countably infinite), the dimen- 

2 d o 

sionality of the apparatus space will be an integer (in fact, either 1 or 2). It will be 1 
when the measure is already projector- valued and does not need an extension; it will 
be 2 when the measure is not a projector-valued measure. Hence, if the Hilbert space 
in Theorem 9 is infinite dimensional (k^), the minimal extended space is also infinite 
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dimensional (M • k = k ). The 'extra' dimensionality required for the most general 
measure is at most°(M-l) * Hence, if the apparatus space is two-dimensional, 

we can guarantee an extension of any measure on the tensor product space 3Cg 0 3C^, 
For Theorems 10 and 1 1, if both dim {3Cg} = dim {3C+} = then the dimensionality 

of the apparatus space required is two./ 
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VIII. SEQUENTIAL MEASUREMENTS 


We shall now discuss the second realization of generalized quantum measurements 
as stated in part (b) of Theorem 1 . Our interests in sequential measurements originate 
from the investigations of the interaction of a system un<ier observation with an appara- 
tus. and sequential measurements being performed separately on the system and appa- 
ratus. with the structure of the second measurement optimized depending on the outcome 
of the first measurement. In order to illustrate how a sequential measurement may 
actually be performed, we give an example of a simple binary detection problem. (See 
Appendix J for a more general problem.) We also analyze sequential measurements 
more mathematically. 

8. 1 Sequential Detection of Signals Transmitted by a 
Quantum System (see Chan^^) 

Suppose we want to transmit a binary signal with a quantum system S that is not cor- 
rupted by noise. The system is in state | s^) when digit zero is sent, and in state js.> 
when the digit one is sent. (Let p^ and Pj be the a priori probabilities that the digits 
zero and one are sent, p^ + p^ = 1 .) The task is to observe the system S and decide 
whether a "0” or a "1" is sent. The performance of detection is given by the probability 
of error. Helstrom^^ has solved this problem for a single observation of the system S 
that can be characterized by a self-adjoint operator. The probability of error obtained 
for one simple measurement is 


Pr[e] = |fl' / 1 -4p^p^|(Si|s^>T 


(72) 


We try to consider the performance of a sequential detection scheme by bringing an 
apparatus A to interact with the system S and then performing a measurement on S 
and subsequently on A. or vice versa. The structure of the second measurement is 
optimized a s a consequence of the outcome of the first measurement . 

Suppose we can find an apparatus A that can interact with the system S so that after 
the interaction different states of system S will induce different states of system A. 
Suppose the initial state of the apparatus is known to be |a^), and the final state is |a^) 
if S is in state Is^). and |a^> if S is in state |s^>, and \a\)* la^>. It is shown in 
Part II of this report that the inner product of the state that describes the system S+A 
when digit ”0” is sent and that which describes it when digit "1" is sent is invariant under 
any interaction that can be described by an interaction Hamiltonian H^g that is self- 
adjoint. That is. 




(73) 


where s^ ) and Is^) are final states of S after interaction if a "0"'or a "I” is sent. 

O' 1 
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Now suppose 


l(s^Is^>| < |<s^|s^)| < I (74) 

which implies also 

|<s^lsj)l < l<a^U5>| < 1. (75) 

We wish to observe S first in an optimal way. The process is similar to Helstrom's 
in that we choose a measurement that is characterized by a self-adjoint operator Og in 
the Hilbert space 3Cg so that the probability of error Pr[€g] is minimized, and it is given 
by 



and the probability of correct detection is 



Suppose the outcome is "1". The a priori probabilities p, ,p of apparatus A for states 
I f f f i O 

I a^ ) and | a^) have been updated to Pr[Cg] and Pr[eg], respectively. 

Now we perform a similar second measurement on A, characterized by an operator 
in the Hilbert space 3C^. A new set of a priori probabilities Pj = Pr[Cg], p^ = 
is used for the states j a j ) and |a^). Assuming that we already have all available infor- 
mation from the outcome of the first measurement in the updated a priori probabilities 
for A, we base our decision entirely on the second measurement. The optimal self- 
adjoint operator is chosen to minimize the probability of error of detection Pr[c] in 
a process similar to the first measurement, and the performance is 

Pr[e]=-|-[l - J\ -4Pr[Cg]Pr[€g]|<a^|af>|2 . 

But Pr[Cg] Pr[Cg] = P^P^K j s^) |^, and < s ^ | s^) ( a^ | a^) = < s J s^>, which gives 

- y^PiPoK®iNo>l^} 

This is exactly the same performance obtained by Helstrom in one simple measurement. 
When the first measurement characterized by the operator Og is performed and one of 
two outcomes will result, we decide (temporarily) that either the digit "0”or the digit "1" 
is sent. Since Og is a self-adjoint operator, it possesses an orthogonal resolution of 
the identity (and hence defines a projector -valued measure on the digits "0” and "1"). 

Let be the corresponding projector-valued measure for the outcome "0". Then 
is the measure for the outcome "1". The probability of getting outcome "0" is P = 

{s| n^j s), where | s) is the final state of S (either [ s^) or | s^)), and the probability 
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of getting outcome "I" is 1-P. We represent this first measurement diagramatically 
in Fig. 3 by a tree with two branches. The transition probabilities are given by P for 



the branch zero, "0", and 1-P for the branch one, "1". If the outcome is "1", we shall 
perform a second measurement on A characterized by the self-adjoint operator O . 
Associated with are the projector-valued measure II ^ and I-II ^ , for outcome " 1 " 
and "0", respectively. If, however, the first outcome is "0", we perform a different 
measurement corresponding to O]^, with associated projector-valued measures n and 
I-n^ for "I" and "0", respectively. and do not have to commute; in fact, they 
do not for the optimum detection scheme (which minimizes the probability of error) in 
this example. Both measurements are represented diagramatically in Fig. 4. The 



decide on "0" 
decide on "l '* 


probabilities of the different outcome sequences are 
Pr{''0", "0"} = (<s|n^|s))(l-<a|n 2 |a>) 

= <a|<s|n^0 (I-n2)is)|a> 

Pr{'-0".«1"}= (<s|n^|s>)«a|n 2 |a>) 

= <a|<s|n^® n 2 |s)[a) 
Pr{'-1»,"0"}= (l-<sln^!s))(l-<alnj[a>) 

= (al<s|(I-n^)(I-nj)|s>la> 


(78) 


(79) 


( 80 ) 
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Pr{«l","l"}= (l-<s|n^|s>)«a|nj|a)} 

= <al<s| (I-n^) ® nj s)| a>. 

When the last outcome is *’0*' the receiver will decide that "0" ("1") was sent. 

It is surprising that an optimum measurement for the binary detection problem can 
be realized as a sequential measurement. Appendix J gives another realization for the 
optimum measurement for a more general binary detection problem. Naturally, we are 
interested in characterizing the general class of measurements that can be provided by 
sequential measurements . 


8.2 Projection Postulate of Quantum Measurements 

In order to characterize sequential measurements, it is necessary to characterize 

the behavior of a quantum system after a measurement has been performed on it. 

Von Neumann has provided a rather mathematical and concise yet complete characteri- 
18 

zation. We shall summarize only the essentials for characterizing sequential mea- 
surements. 

When a measurement corresponding to a self-adjoint operator A is performed on 
a quantum system S. the outcome of the measurement will be one of the eigenvalues of 
the operator A. and the resulting state of the system S will lie in the eigenspace cor- 
responding to that eigenvalue. More precisely, let {p^}^j be the orthogonal resolution 
of the identity given by A, such that 

M 

S P. = I 
i=l ^ 

and (82) 


M 

A = Z a..P... 

i=l 


1 1 


where each a^ is a real eigenvalue of A corresponding to the projector P.. The proba- 
bility of getting the eigenvalue a^ as the outcome is 

P(a.) = <s|Pjis> (83) 

if S is in the pure stale Is), or 

P(a.) = Tr{pP.} (84) 


if S is a statistical mixture described by the density operator p. 

Given that the outcome is the value a^, the postulate states that the system will be 
left in the state | s'): 



(85) 
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if S is in the pure state | s). The factor (s|p. in the denominator is fr.r normal- 

iisation. If S is described by the density operator p, it will be left in the state described 
by the density operator 


P’ 


PjPPi ^ 

Tr{p.p}’ 


(86) 


where the factor Tr{p.p} is for normalization./ 

Julian Schwinger gives a more general statement on the Projection Postulate.^^ He 
asserts; Given that the eigenvalue a^ is the outcome, the system can result in a state 
that is not entirely in the eigenspace corresponding to the projector P^. This does not 
contradict the view of von Neumann. If a transformation characterized by a unitary 
operator which is due to an interaction with some other quantum system is allowed 
after the measurement has been performed, the system can result in a state that does 
not lie in the eigenspace into which Pj projects. In this sense the von Neumann postulate 
can adequately take care of all physically possible situations. The Schwinger formula- 
tion does not add new dimensions to our problem, and we shall not give a precise state- 
ment of his views here, nor prove its equivalence to von Neumann's views. 


8. 3 Mathematical Characterization of Sequential Measurements 

In this section we shall characterize sequential measurements mathematically in 
terms of the statistics of the outcomes of the measuring process. The basic concept in 
the characterization is simple, given the projection postulate of von Neumann, although 
the mathematics for the most general characterization sometimes seems very compli- 
cated and formidable. P. A. Benioff has recently written three papers^"^ on the detailed 
characterization of each sequential measurement. That characterization is too compli- 
cated and involved for our purposes. We shall ouliine a simple characterization based 
on von Neumann's projection postulate. For our areas of concern, in effect it will have 
all of the generality of Benioff s characterization. 

It is important to note that the type of sequential measurements we are considering 
involves a decision procedure at each step of the measurement. To start the measuring 
process, a measurement corresponding to a self-adjoint operator is performed. Then, 
depending on the outcome of the first measurement, a decision is made about what the 
second measurement should be. The form of each subsequent measurement is decided 
on the knowledge of the outcome of each previous measurement. The decision proce- 
dures can be predetermined. That is, before the start of the measuring process we can 
prescribe the measurements that should be performed contingent on the various possible 
outcomes. This enables us to represent the measuring process in the form of a tree as 
in Fig. 4. 

Figure 5 is an example of a typical tree. Each vertex is labeled by a let- 
ter with numerical subscript (for example, c^). At each vertex (with the exception 
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of the terminal vertices such as c^ and dj) 
a measurement corresponding to a self- 
adjoint operator is performed. English 
letters are used to label the chronologi- 
cal order of the various measurements 
performed in the process. Thus the mea- 
surement at any vertex labeled by the 
alphabet *c' follows the measurement at 
a vertex labeled b, and the measuring 
process evolves chronologically from 
left to right in the manner in which the 
tree is drawn. 

Let the self-adjoint operator corre- 
sponding to the measurement at an arbi- 
trary vertex Oj {where a is an alphabet, i an integer) be labeled as . Without 

loss of generality, the number of different outcomes of each measurement is assumed 
to be finite (the infinite case will be considered later), so that at each vertex the forward 
progress of the tree representing all possible outcomes of the measurement is described 
by a finite number of branches. When the measurement at a vertex, say is per- 
formed, one of several outcomes may result with certain probabilities, and they are 
represented by all of the vertices on the right of the vertex o. that are directly connected 
to it (by directly we mean that the connection does not go through any other vertex or 
vertices). Each of these vertices labels an outcome. For example, the measurement 
at vertex b^ in Fig. 5 has three possible outcomes, c^, c^ and c^- The self-adjoint 
operator corresponding to the vertex o. defines a projector- valued measure on the 


Figure 5 


set of all possible outcomes that is labeled by the corresponding vertices. If the vertices 

!. 

3 


are p.. j = N ,N 


c. 

1 


a. + l 
1 


M , ,M , where N are both integers, let the projector- 

o . o . a. a. 


^ " M ^ 

a. 

valued measures be {Pg f • Of course. 

1 


lU 


a. 

1 


j=N, 


Pp = I, the identity operator 


and 


a. 

1 


i 




S 

j=N 


(87) 


a . 

1 


^P .^p,’ 


where the \„ are the distinct real eigenvalues of the operator . 
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When the sequential measuring process takes placei the state of the system will fol- 
low a certain 'path' of the tree. At each measurement only one of several outcomes can 
occur; therefore, each of the possible paths the system may follow is well-ordered in 
the sense that all vertices in the path are connected in the chronological order of the 
English letters that label them. Each path starts at the initial vertex a^ and ends at a 
terminal vertex. Thus in Fig. 5 (a^,bj^i c^, dg) is a path and (aQ,bj, Cg) is not. We use 
the labels of the vertices of a path to label the path. Since different measurements can 
be performed at different vertices, the sequential measuring process may be said to 
involve a decision procedure. The operators can be predetermined, but a measure - 

ment corresponding to one O is chosen, depending on the previous outcome which is 

“i 

probabilistic. In order to characterize this sequential process, we must specify the 
statistics of the outcomes. Specifically, if the system is in some initial state, we want 
to know the probability of it following a certain path. A straightforward application of 
von Neumann's projection postulate provides the answer. 

Let the system be in the pure state | s) originally. We will determine the probability 
of it following the path, say (a^, b^, c^, d|^, . . . , Pj^), where i. j, k, S. are some integers and 
Pj is the terminal vertex. When the measurement is performed, the probability of 

the system branching to the vertex b, is (s|p. |s), where P, is the projector -valued 

1 D. Dj 

measure of the outcome b^. By the von Neumann projection postulate, when the out- 
come b. occurs the system is left in the state 



( 88 ) 


In general, given that the system is in the state 1 s' ) at a vertex Cj, the probability of 
branching to the vertex p. is ( s" | P„ Is"), and as a result the system will be left in 

K Pj. 

the state 


Pq U"> 
Pk 


<s'|P. |s"> 


«\l/2 


Hence the probability of following a path (a^. b., c^, dj^, . . . , p^) is given by 
Pr{a^,b., Cj.dj^ p^| |s)} = <s|p^^ |s) ( s(b^) | P^ | s(bj) ) 

<s(c.)|p . |s(c.)>... . 

.1 Ol, J 


( 89 ) 


For arbitrary vertices a , p with p immediately following a , 

n m m n 
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<8'!p^ |s'Xs'(a„)lP s’(c^)) 


n 


m 


= <S'|P^ I s’) 




P, Is') 

n 


n 


(s'|P„ Is') 


.^l/2 P 


m 


n 


<s'iP„ Is') 
n 


,vl/2 


’’s P. I“'>- 


n '^m n 


Therefore, by induction, 

Pr {eq, bj. c ., I I s )} = < s I Pb.Pc .^dj^ ' ' ' .^b. I ® ^ ’ 


(91) 


Define the operators 


R(a^. b^, c., d^ Pj) = Pb^Pc ^dj^ ■ • - Pp/ 


(92) 


and 

Q(a^, bj^, Cj, dj^, ...» p^) - R(aQ, b^» • • • » Pjj) R^(a^,b., . . • , p^^). (93) 

Then 

Pr{a^,b^, c^, ,..,pj |s} = Pr{path| |s)} 

= <s|Q(a^,bj. c^, . . .,Pjj)|s) 

= ( s I Q(path) I s ) . (94) 

It can be shown that Z Q(path) = I, the identity operator, and Q(path) ^0, for all 
all paths , 

paths. So the set of nonnegative -definite operators {Q(path)}^jj paths operator- 

valued measure for the set of all outcome paths of the sequential measurement. And 
the measures adequately characterize the statistical properties of the sequential mea- 
suring process. 

Note that we have discussed the case when the system is in a pure state. When it 
is described by a density operator, in general the mathematical arguments are essen- 
tially the same but the notation is more complicated. The derivation is omitted here. 
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EX. SOME PROPERTIES OF SEQUENTIAL MEASUREMENTS 

A sequential measurement does not correspond in general to a measurement char- 
acterized by a self-adjoint operator in the original Hilbert space of the system because 
the operator-valued measure for a path does not have to be a projector. An example is 
the sequential measurement represented by the tree in Fig. 6. 



Figure 6 


The operator-valued measures for the path (a^. b^i c^) is 




Q = Pu P Pk P., Pk • 

c c_^ b. 

o o o o o 

If p, and F' do not commute, 

O O 


(95) “ 

(96) 


* Q. 


(97) 


Hence Q is not a projector-valued measure, and the sequential measurement does not 
correspond to any single self-adjoint measurement on the system alone. 

Theorem 12 gives the necessary and sufficient condition that a sequential measure- 
ment must satisfy so that there is a single self-adjoint measurement on the system that 
would generate the same measr.rement statistics. 


Theorem 12 

A sequential measurement is equivalent to a single measurement characterized by 
a self-adjoint operator on the Hilbert space of the system if and only if the operator- 
valued measure of every path is a projection operator. / 


Proof: Since the measure of each path is projector-valiT'id, by the Theorem for the 
Orthogonal Family of Projections (see Appendix A), the measures are also orthogonal 
and thus form an orthogonal resolution of the identity that is the spectral family of some 
self-adjoint operator. Conversely, if the measure Qj^ of the outcome of a path H is not 
projector- valued, then it is not orthogonal to all measures of the other outcome paths. 
Hence the measurement does not correspond to that of a single self-adjoint operator./ 
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Corollaries 3 and 4 give two sufficient conditions that may be more useful. 


COROLLARY 4. A sequential measurement is equivalent to a single measurement 
characterized by a self-adjoint operator on the Hilbert space of the system H the pro- 
jectors {p } of all the vertices {a.} of each path pairwise commute. / 

1 

Note that two projectors from two different paths do not have to commute. 


Proof ; If the projectors for each path pairwise commute among themselves, then 
the operator-valued measure Q for each path can be written 


Q(ao< b., c^, 




bi c. 


.Pp. ...P^P, 
c. b. 



P 


c. 


3 



(98) 


and 

= Q. (99) 

Hence the measure Q for each path is a projector-valued measure and corresponds to 
the orthogonal resolution of the identity given by a self-adjoint operatci' defined on the 
Hilbert space of the system./ 


COROLLARY 5 . A sequential measurement is equivalent to a single measurement 
characterized by a self-adjoint operator on the Hilbert space of the system H the pro- 
jectors {p } of all of the vertices {a.} of the whole tree pairwise commute./ 
i 


Proof ; If all projectors in the tree pairwise commute, then the projectors of all 
vertices of each path pairwise commute. By Corollary 4 the theorem is true./ 

Note that in the examples of binary detection in section 8.2 and in Appendix J, the 
sequential measurements satisfy the conditions of Corollary 4 but not those of Corol- 
lary 5. 

Finally, we should be concerned about the number of individual measurements that 
is necessary in a sequential procedi're to realize certain measurements. Theorem 13 
is obvious but will be useful later. The proof is omitted. 


DEFINITION. The length of a tree is the maximum number of vertices that a single 
path of that tree connects exclusive of the terminal vertices. 


Theorem 13 

Any self-adjoint measurement with a finite number of outcomes M is equivalent to 
some sequential measurement characterized by a binary tree of length N, where N is 
the smallest integer such that 

M € 2 ^./ ' . ( 100 ) 
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X. SECOND REALIZATION OF GENERALIZED MEASUREMENTS: 
SEQUENTIAL MEASUREMENTS 


We have given an example of a two-stage sequential measurement characterized by 
a binary tree of length two (see Fig. 6). The resulting measurement is of a generalized 
form. That is, it is characterized by an operator-valued measure but not by a projector- 
v-^lued measure. We shall now characterize several classes of operator-valued mea- 
sures that can be realized by sequential measurements, and prove pairt (b) of Theorem 1 
for several classes. It is important to realize that not all operator-valued measures 
can be realized by sequential measurements. For example, the operator -valued mea- 
sure given in Example 3 cannot be realized by sequential measurements, since the 
Hilbert space that describes the possible state of that system is two-dimensional. Any 
nontrivial measurement must have at least two possible outcomes. If the operator -valued 
measure can be realized by a sequential measurement, the first nontrivial measurement 
of the sequence will leave the system in one of two known pure states, and subsequen'' 
measurements will correspond to randomized strategies and yield no new information 
on the original state of the system. It can be shown that such sequential measurement 
has a different performance from the operator-valued measure described in Example 3, 

In fact, the detection performance of that measure for the three equiprobable states 
^ 2 
{| ^ in Example 3 is given by the probability of correct detection Pr[c] = . 

whereas any sequential measurement has performance Pr[c] <’j. 


Theorem 14 

If an operator -valued measure is defined on a finite index set , with values 

as operators in a finite dimensional Hilbert space 3C, (dim {3C}= N), and the measures 
{Q.} pairwise commute, then it can always be realized by a sequential measurement 
characterized by a tree with self-adjoint measurements at each vertex. In particular, 
if M ^ N, the sequential measurement can be characterized by a tree of length two. In 
general, the minimum length of the tree required is the smallest integer i such that 


f ^ 1 + 


log M 
log N • f 


( 101 ) 


NOTE. For a source with alphabet size A and output rate R, the number of output 

R,T 

messages in a duration of T seconds is M = A . Hence, for block detection of M 

signals generated in a duration of T seconds the required number of steps £ is 


£ 


log M 
^ log N 


= 1 + RT 


log A 
log N' 


( 102 ) 


For large T, 
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£oc T. 

Therefore the average number of measurements to be performed per second, 
constant for large T, and 


(103) 
£/T, is 


1 log A 

T " ^ log N • 


(104) 


If the dimension of the Hilbert space N changes with time, these expressions still hold 
by replacing N = N(T). For N(T) = DT, where D is a constant, 


1 . log A 

T ^ ^ log D + log T ’ 
and for large T, 

T ^ log T’ 


(105) 


( 106 ) 


which approaches zero independent of D. 

SIGNIFICANCE. From the construction of the sequential measurement given in 
Theorem 14 (see Appendix K) we can see that measurements given by operator-valued 
measures that pairwise commute are not particularly interesting in commimication con- 
text. After the first measurement, subsequent measurements do not furnish any more 
iiiformation about the system imder observation because the first self-adjoint measure- 
ment is a complete measurement in the sense that its eigenspaces are all one- 
dimensional. After the first measurement is performed the state of the q’ mtum system 
is completely determined by the pure state that corresponds to the outcome eige. 'alue. 

It can be seen that there is no mutual information between subsequent measurements 
and the initial unknown state of the system. From the proof in Appendix K it is apparent 
that the second measurement can actually be replaced by a randomized selection of out- 
comes, and the randomized strategy will give the same measurement statistics. But 
we know that we cannot gain performance by a randomized strategy. So one single self- 
adjoint measurement will perform just as well as the full sequential measvirement. Hence 
we have the following corollaries. 

COROLLARY 6. If a quantum measurement is characterized by an operator-valued 
measure, with the measures of all outcomes pairwise commuting, then the measurement 
is equivalent (in the sense that it has the same outcome statistics) to a single self-adjoint 
measurement followed by a randomized strategy. / 

Corollary 6 gives us the following very important result. 

COROLLARY 7. For a measurement characterized by an operator-valued measure 
to outperform all self-adjoint observables, it is necessary that the measures of the out- 
comes do not all pairwise commute, / 
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When the Hilbert space is infinite dimensional but separable, Theorem 14 can be 
extended to handle the situation. In Appendix L we sketch how we can generalize 
Theorem 14. We can then state the following theorem. 

Theorem 15 

If an operator-valued measure is defined on an infinite index set , with values 

as operators in am infinite dimensional separable Hilbert space, and the measures {Qj^} 
pairwise commttte, then it can always be realized by a sequential measurement charac- 
terized by a tree with self-adjoint measurements at each vertex. Sometimes the length 
of the tree is infinite. 

Theorem 1 6 discusses the realization by sequentisil measurements of a pairticular 
class of operator -valued measure. The conditions that characterize this class appear 
rather stringent and it can be argued that the realization of such a narrow class of 
operator-valued measures is not very useful. It turns out, however, that a large class 
of quantum commvtnication problems satisfies these conditions. Exactly how this theo- 
rem can be applied to almost all quantum communication problems will be apparent after 
the discussion of equivalent and essentially equivalent measurements. 

Theorem 16 

If an operator-valued measure is defined on a finite index set (i = 1, . . . , M) 

with operator values in the Hilbert space 3C, and the measures are projector-valued 
except on a subspace -4^ C 3C such that M dim {^} ^ {^}. then it can always be real- 
ized by a sequential measurement characterized by a tree with self-adjoint measurement 
at each vertex. / 

Proof ; Let 

n. = Urn q". Mi = 1 M, (107) 

^ n — 00 

where n is a positive integer. The 11^ are projection operators, and 
M \ 

ije - s n.jK = 

Let 

Rj — Q. — n.» 

Then 


M 





(110) 


(108) 


(109) 
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where P ^ is the projection operator into the subspace , and \ ^ is the identity 


v4( f f 1 ivi 1 

operator on the subspace ^ . The set of projection operators forms 

an orthogonal resolution of the identity in the space 3C. 

That is. 



M 

P + S 
i=i 


"" ^3C' 


( 111 ) 


Figure 7 


Let the first measurement on the system under observation be 
characterized by the projector-valued measures {P 
This measurement can have one of M+1 outcomes. Symbol- 
ically, it can be represented by the tree in Fig. 7. If the 
outcome is represented by a vertex corresponding to one of 
the n., the measurement can stop. If the outcome ends in 
the vertex corresponding to the projector P^ , a second 
measurement is required to complete the sequential measurement process. 

The set of operators sums to the identity operator I^ in the subspace 

and each of the operators R. is nonnegative-definite. Hence they form an operator- valued 
measure on the subspace U/ • By Theorems 2 and 4, there exists on an extended space 
3C ^ Ji , a projector- valued measure |P^j-^_j^ such that 


M 

S P. = I (112) 

i=l ^ 3C'*' 


where I , is the identity operator on 3C , and 


R. = P .. P.P „ . 
1 JK I 


(113) 


By Theorem 5, the minimum dimensionality of this extended space 3C that is required 
is less than or equal to M times the dimensionality of the original space . That is, 


min {dim {3C^}} M dim {^}. 


(114) 


By assumption, 


dim {3C} 3= M dim {^}- 

Hence 

dim {3C} > min {dim {3C^}}, 

and 

^ C3C. 


(115) 

(116) 
(117) 


Therefore it is possible to find a projector-valued measure {Bjlfij in 3C such 
that 


40 



M (118) 

and ~ 

M 

i?i^i = ^3e- (119) 

If the outcome is in the vertex corresponding to , after the first measurement 
we can pe^orm a second self-adjoint measurement given by the projector -valued mea- 
sure as represented by the tree in Fig. 8. By a previous result (see Sec. VIII), 



the operator-valued measure for the path ending in the vertex corresponding to the pro- 
jector is 

- R^. i = 1 M. (120) 

Hence the operator-valued measure Q. is the sum of the measures of two paths, one 
ending in the vertex corresponding to P., the other in the vertex corresponding to II.. 
The whole sequential measurement is represented in the tree in Fig. 9. Therefore 



Figure 9 
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we have a realisation of the given operator-valued measure by sequential measurement. 
Thus we have proved a case in part (b) of Theorem 1. / 

NOra. The condition that M dim d dim {K} can be relaaed i£ more structurea 
on the are given. If we have 


m 

dim{(ft{Rj} « dim{3e}, 


»here « {Hj} ia the range apace of R^. then hy Theorem 7 we can alwaya find a projector, 
va ued extension mK . (Remember that in dealing with infinite dimensional spaces 
caution should be taken in interpreting the results.) 

Corollary 8, which is a useful consequence of Theorem 16, will be needed in Sec- 
tion XII. 

COROLLARY 8. If an operator-valued measure ig defined on a finite index 

set (1 = 1, .... M). with operator values in an infinite dimensional Hilbert space 3C. and 
the measures are projector-valued except on a finite dimensional subspace , then it 
can always be realized by a sequential measurement characterized by a tree with self- 
adjoint measurement at each vertex. / 


Proof ; 

Mdim{.^}<<« = dim{3C}. 

Therefore Theorem 1 6 applies. / 

In Theorem 16 we exploited the property of a special class of operator-valued mea- 
sures that are projector-valued except in a finite dimensional subspace. In fact, this 
fimte dimensional subspace is an 'invariant subspace' for the operator -valued mea- 
sure. If we explore the proportions cf invarir.nt subspaces for an operator- valued mea- 
sure, we can realize a larger class of measures as sequential measurements. These 
results are very important because we shall show in Section XII that there are com- 
munication problems that fall within such a class. 

definition 5. A closed subspace in a Hilbert space JC is called an invariant 
subspace for the operator A if Ax e whenever x e {that is, A uT c Ji)7/ 

DEFINITION 6. A closed linear subspace ^ in a Hilbert space 3C reduces a 
bounded self-adjoint operator A if both and ^ ^ are invariant 

for A. / ^ 


Lemma 1 

H A is a bounded self-adjoint operator, the subspace reduces A if and only 
if is invariant for A. 


i 


Proof: 


(i) If reduces A, by definition ^ is invariant for A. 

(ii) If X e y e Ax e 

Hence 

(Ax. y) = {x. Ay) = 0. (123) 

Therefore, Ay ^ and is also invariant for A. / 

If a subspace ^ reduces A, then tne problem of characterizing the operator A on 
5C reduces to the problem on ^ and and A can be written as 


A = 


P AP 


+ P 


AP 


(124) 


where P , P , 
^ ^ -L 


are the projection operators projecting into ^ and 


i 


. respectively. 


In general, a self-adjoint operator A can have more than one invariant subspace, 
For example, every eigenspace of a self-adjoint operator is obviously an invariant 
subspace. 

r 1 N 

If a set of orthogonal subspaces are invariant for a bounded self-adjoint 


M 

operator A, so that •M, A = 0, for i j, and © Ui . = 3C, where © indicates 

^ J i= 1 ^ 

direct sum, then A can be written 


N 

A = S P „ AP ^ . (125) 

i=l i *^1 

and 


where P is the projection operator into the subspace ^ .. 

i ^ 

For a bounded self-adjoint operator, a useful set of invarieint subspaces is the set 

of eigeiispaces. 


DEFINITION 7. A closed linear subspace is a simultaneous invariant subspace 
of a set of botmded self-adjoint operators if ^ is invariant for each operator 

A^, 1— 1,..., B4. / 

Later we shall show how to find a set of simultaneous invariant subspaces for a 
set of bounded self-adjoint operators. Assume for the moment that given a set of 
bounded self-adjoint operators, we know how to find the simultaneous invariant sub- 
spaces. 

If a generalized measurement given by a set of operator- valued measures 
is given, we can try to find the simultaneous invariant subspaces of the Q.. Let a set 



•V- ''-.H., • ’•f - ■: •- : tS. - ••• •; ri 


I 


43 



of orthogonal subspaces 




Then 


be simultaneously invariant for the set of operators 


N 

Q. = S P ^ QiP ^ > i = 1, . . . , M 
j=l 


N 

j=l •> 


(127) 


where 

^ : for all ij 


(128) 


and 

J=1 J 


Since 


is an orthogonal resolution of the identity, 


it corresponds to some 


self-adjoint measurement. Let the first measurement be characterized by this 
projector- valued measure. Then it can be represented symbolically as in Fig. 10 by 
the initial segment of a tree. 



Each of the N sets of nonnegative-dcfinite operators forms an operator-valued 


measure with values as operators in their corresponding subspace That is, 

Q. . 2: 0 
ij 


( 130 ) 


M 

£ Q.. = P 
i=l 


J{ ' 

J J 


j — 1,...,K, 


(131) 


where I ^ is the identity operator in the subspace ^ 

1 f 1 

If the first measurement given by the projector-valued measure -{P^ j- is 
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performed, the outcome will be in one of the vertices in Pig. 10, Suppose the outcome 
is represented by the vertex corresponding to the projector P ^ , then the second mea- 

^ r 1 M 

surement should be characterized by the operator -valued measure Since the 

operator-valued measure is defined only on the subspace ^ j and we can choose for the 
second measurement any self-adjoint measurement defined on the entire space 3C , under 
suitable conditions the second generalized measurement can be realized by a 

self-adjoint measurement defined on 3C that includes as a subspace and acts as an 
extension space of Specifically, if the operator-valued measures satisfy one of 

the following conditions: 

(i) M dim ^dim {JC} (132) 


(ii) £ dim {(R{q. I} <dim {JC}, 
i=l 


(133) 


then it is possible to find a projector-valued measure with operator values 

defined on the entire space 3C such that when restricted to the subspace US' j will give 
operator-valued measure That is, 


P 


P P 

•^j 



i = 1 , . . . , M 
j = 1 N 


(134) 


M 

£ P.. = It«, for all j. (135) 

i-1 y 


This means that if the outcome is given by the vertex corresponding to P ^ , the rest 

of the measuring process can be realized by a second self-adjoint measurement on the 
system. If indeed each of the N operator-valued measures j = 1, . . . , N satis- 

fies either condition (i) or condition (ii), then we can guarantee, whatever the outcome 
of the first measurement, that the subsequent and final measurement will be a 
self-adjoint measurement. Condition (i) is from Theorem 5 and condition (ii) from 
Theorem 7, 

The two-stage sequential self-adjoint measurement is represented by the tree in 
Fig. 11. The event corresponding to the operator -valued measure Q. is then the N 

possible outcome paths labeled by the projectors {P ^ ’^ij}* 3 “ ^ N as shown in 

Fig. 11. and 


= I. 


J 


N 


= 2 P P P ^ . / 
J=1 *^j 


(136) 
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Hence we have the following theorem. 

Theorem 17 

If an operator -valued measure {Qi}^i has a set of mutually orthogonal simultaneous 

r 1 N 1 l- i 

invariant subspaces such that 

J J 

N 

V ^. = 3C (137) 

j=l . ^ 

A = 0* all i j (138) 

and 

N 

Q = S Q (139) 

1 j=l 13 

where 

s P ^ • all i and j (140) 

J J 

smd if each of the N sets of operators J 1 N satisfies either one or both 

of the two conditions (Eqs. 132 and 133). then the operator-valued measure can be real- 
ized as a sequential measurement characterized by a tree of length two with self-adjoint 
measurements at each vertex. / 

EXAMPLE 9 

(1) If the pairwise commute as in Theorems 14 and 15, then they can be diagonal- 
ized simultaneously by their eigenvectors. These eigenvectors are then one-dimensional 
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simultaneous invariant subspaces. Such operator -valued measures satisfy the conditions 
of Theorem 17 and therefore they permit a realization by sequential measurements. 

(2) The measure in Theorem 16 also satisfies the conditions of Theorem 17. The 
finite dimensional subspace Jt on which the Q. are not projector-valued is a simulta- 
neous invariant subspace for the set of measures The projector-valued part 

of the measures can be realized by a single self-adjoint measurement. The non 
projector-valued part is separated out because it is within a finite dimensional simul- 
taneous invariant subspace. This, in turn, permits a sequential measurement realiza- 
tion, as given in Theorem 16./ 

A natural question to ask is, " Do most operator-valued measures encountered in 
quantum communication possess simultaneous invariant subspaces?” If the answer is 
negative, then sequential measurement will be of limited use in the realization of mea- 
surements in quantum communication. We are not yet in a position to answer this 
question fully. In Sections XI and XII, we shall consider 'equivalent classes' of mea- 
surements. In quantum communication problems most of the generalized measurements 
have equivalent measurements that possess simultaneous equivalent subspaces, and 
almost all quantum measurements of interest can be done sequentially. This issue will 
be discussed in detail in Section XII. 

In lieu of conditions (i) and (ii), we want to find in some sense the ' finest' decompo - 
sition of the Hijbert space 3C into simultaneous invariant subspaces. The reason for 
a 'finest decomposition* (by which we mean that the dimensionalities of the subspaces 
are as small as possible) is simple. If the dimensionality of each of the subspaces ^ . 
is made as small as possible, in a loose sense we have more available dimensions in 3C 
for an extension. It is possible to show that there is a construction procedure to find 
a 'finest decomposition' and this decomposition is unique. The main statement is given 
in Theorem 18 and an outline of the proof is given in Appendix M. 


Theorem 18 

For a set of self-adjoint operators {T^}^ ^ it is possible to find a unique 'finest' 
set of simultaneous invariant subspaces that are pairwise orthogonal and 


N 

T = 2 P„ T Pc . all a G A. 
c S. « S. 

EXAMPLE 10 

We make use of the measure in Example 5, except that we use a Hilbert space 3C j 
with one extra dimension spanned by the vector |f). Let {] s^) span a two-dimensional 
subspace of 3C j orthogonal to j f). Define 




i = 1,2 


(142) 
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(143) 


where 2 jf) (f|. 

1 

2 
3 

Figure 12 

The measurement {Q^} can be realized by the sequential measurement shown in 
Fig. 12./ 
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XI. EQUIVALENT MEASUREMENTS 


Very often in quantum communication two measurements characterized by different 
operator -valued measures will yield the same performance. For any given quantum com- 
munication problem (whether it be a detection or an estimation problem) it is possible to 
categorize the set of all generalized measurements into 'equivalent classes' of measure- 
ment, so that every measurement of the same equivalent class will give the same per- 
formance. 

Let the received information-carrying quantum system be described by the set of 
density operators assume that there exists a set of simultaneous invariant 

subspaces such that 


and 


N 

p s z Pq P„Pq , 
Pc S. c 


N 

Z Pq = I™. 
i=l ®i ^ 


>c4o G A, 


(144) 


(145) 


Let {Q an operator -valued measure corresponding to some generalized 

measurement under consideration, where B is some index set for the outcome. 

Given that the received quantum system is in an arbitrary state given by the density 
operator p^, the probability of getting the outcome P when the measurement is per- 
formed is given by 

Pr[pl«] = Tr{p^Qp} 

= Tr{ S Pg.Pa^S.^^P^ 

1 = 1 1 I 

1 = 1 1 1 
= Tr{p^Qp}. all peB, 


(146) 


where 


N 




loip G B. 

In (146) the identity Tr {ab} = Tr {ba} has been used. 


( 147 ) 
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The set of operators {Qp}| 3 ^B has the following properties, 
Qp>0. all peB, 


Z 

peB 


A 



N 

2 2 PrQqPs 

PSB i=l ^ 


i 


( 148 ) 


N 

= 2 Pg ( 2 Q J Pg 
i=l ‘’i PGB P 1 

N 

= ^s.Vs. 

1=1 1 1 


= I 


3C' 


(149) 


There the set of operators tep}peB forms an operator -valued measure corresponding 
to a generalized measurement which will give the same performance as the measure- 
ment characterized by the measure {Qp}p^B' this sense the two operator -valued 
measures correspond to ' equivalent measurements ,' and they belong to the same equiv- 
alent class of measurements. Note that equivalence is established only with respect to 
the given structure of the density operators {p } _ 

r A I ® <Z t. A 

The measurement corresponding niay have an advantage over the mea- 

surement corresponding to {Qplpgg, since it may have a 'finer' decomposition into 
invariant subspaces, and this would facilitate realization by sequential measurements. 


COROLLARY 9. 
IM 


In an M-ary detection problem when all of the density operators 
pairwise commute, they can be diagonalized simultaneously. If ( ^ is 

if 


their set of orthonormal eigenvectors whicn spans 3C, for any operator -valued measure 
the measure 





is an equivalent measurement and the Q. pairwise commute. By Corollary the mea- 
surement is equivalent to a single self-adjoint measurement followed by a randomized 
strategy. By Corollary 7, this measurement at best is equal in performance to some 
self-adjoint measurement. Hence the optimal measurement for the M-ary detection 

problem with pairwise commuting density operators is a self-adjoint operator. / 

17 

Helstrom has proved this result by using a different method. 
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XII. ESSENTIALLY EQUIVALENT MEASUREMENTS 


We have discussed 'equivalent classes of measurements' in the sense that when two 
measurements belong to the same equivalent class they give exactly the same perfor- 
mance. The decomposition into simultaneous invariant subspaces is useful in realizing 
generalized measurements by sequential measurements, utilizing the procedure pro- 
vided by Theorem 17. But not all generalized me-^su^ements can be realized in this 
fashion, and in some cases we have to use the realization by adjoining an apparatus. If 
the Hilbert space that describes the states of the information-carrying quantum system 
is infinite dimensional but separable, then given any arbitrary operator-valued measure 
that is not realizable by a sequential measurement, it is possible to find a sequential 
measurement whose performance can be arbitrarily close but not equal to that of the 
'unrealizable' measurement. We shall show this result for the quantum detection prob- 
lem and then for the estimation problem. 


Theorem 19 


Given a generalized measurement characterized by an operator-valued measure 
{Qf}^! for an M-ary quantum detection problem with a probability of correct detection 
Pr[C]]» if the Hilbert space that describes the state of the received information-carrying 
quantum system is infinite dimensional but separable, then for any arbitrary € > 0 no 
matter how small, there is a sequential measurement characterized by the operator- 
valued measure {Q.}^, that will give a probability of correct detection Pr[C 2 ]* such 
that 


Pr[Cj]-Pr[C2][ <e./ 


(150) 


Proof ; Let the received quantum system be in the state described by the density 
operator if the message is sent with a priori probability p.. The probability of 
correct detection for the generalized measurement is 


M 


Pr[Cj] = ^2 p. Tr {p.Q^}. 


(151) 


Since all the p. are trace class operators, they are compact operators. (An operator T 

is said to be compact if it maps bounded sets onto sets whose closures are compact.) 

Hence they each have a set of eigenvalues associated with a set of complete eigenvectors 

20 

(for a proof see Segal and Kunze ). We want to find a finite -dimensional subspace 
such that given a density operator p. and e > 0 no matter how small, 

1 ^Tr{Pg p.Pg}> 1 - e. (152) 

i i 

If the range of p. is finite dimensional, S. can be taken to be the range space so that the 
trace is one. If the range of p. is infinite dimensional, we can find by exploiting the 
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property of as a compact operator that "the set of eigenvalues of a compact self- 
adjoint operator is a sequence converging to zero."^® I^t be the eigenvalues 

of p^i then 


lim \ 
n— » 


n 


= 0 


(153) 


and 


00 


S 

n=l 


1 = Tr {p.}. 


(154) 


Hence there is a finite set of eigenvalues such that 
I ^ S >. > 1 - 


(155) 


Let S. be the finite -dimensional subspace spanned by the eigenvectors corresponding to 
this finite set of eigenvalues. Then 


1 ^ Tr {p p P } = s \ > 1 _ e . 

h ^ ®i ne^ " 


(156) 


Let the set of subspaces be so chosen for the set of density operators • It 

is clear that each subspace is invariant for tiie corresponding p^, since S. is a finite 
sum of the eigenspaces of p.. Let ifC - S. = S?. Then 

i 1 i 




i = li 


,M 


(157) 


and 



M 

Let S = V S. . Then 
i=l ^ 


= Tr{p p P }<e. 

Sf Si 


M 


dim {s} < S dim {s.} < <». 


i=l 


(158) 


(159) 


Hence S is finite dimensional and 

^^eCPi^ J = Tr {P ^,p.| < e, all i = 1, . . . , M. 
S S d 


(160) 


If Is an operator- valued measure with a probability of correct detection Pr[c ]. 

we claim that the operator- valued measure {Q. = PgQ.Pg + p.P has an error per- 

formance Pr[C 2 ] such that | Pr[Cj ] - Prfc^] | < € . Then we have 
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(161) 



But the second term on the right is positive, and 



Therefore 

Tr {PiQi}*-Tr{Pg_P.PsQi}<e- 


(162) 


(163) 


whereas 

Tr {PgPiPs^^i^ U (S-S.)Pl^Sj U (S-S.)^^!^ 

= Tr {(Pg. +Ps-S.^Pi<^S. 

= Tr {Pg.PiPs.S^ + 

+ Tr {Pg.PiPs-S.'^J {Ps-S.^i^Sj^i^* 


(164) 


Since S. is invariant for p., Pg commutes with p. and Pg Ps-s. = 

1 •*■ 1 1 

terms in (164) are zero. Since both p. and are nonnegative -definite, the 
is nonnegative. Hence 


the last two 
second term 


0 «Tr {p.Qj} - Tr {PgPjPgQi} 

= Tr {p.Q.} - Tr {p p.Pg Q.| - Tr {Pg_g. PiPs-S.S^ 
i i i 1 1 1 

<e, Mi=l M. ( 165 ) 


Therefore 


lPr[Cj]-Pr[C2]l 


S p.(Tr {p.Q.} - Tr {PgPiPgQi} ' Tr {PiPjP 
i=l ® 

S pj Tr {p Q.} - Tr {PgPiPgQil " Tr {PiPjP 
.^1 1 1 ^ b 


The operator- valued measure {Q^}^! can be realized as a two-step sequential measure- 
ment. The first measurement will have two branches. The projectors corresponding 

to them are {Pg and I - Pg = P p}* 

” S 
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Given that the outcome is the vertex corresponding to P^, the second measurement 

® r 

has to have the same result as the operator-valued measure iPs*^i^Sn=r mea- 

sure is a resolution of the identity of a finite -dimensional space Si and by Theorem 5 it 
permits an extension to a projector-valued measure in any infinite -dimensional space 
that contains S as a subspace. The original Hilbert space 3C can be taken to be that 
subspacei so that the second measurement is realizable by a self-adjoint measurement 
associated with the projector-valued measure such that 


M 

S 

i=l 


^i = ^JC’ 


(167) 


Ps^^i^s = PsVs' ^1'=' 


(168) 


When the outcome is in the vertex corresponding to the projector P (which would occur 

with very little probabilityt < €), the second measurement can be done by a random 
selection of one of the M messages with probability p.» i = It . . . r M. Or we may con- 
sider the whole event to be an outright error and call it an erasure i as in an erasure 
channel. 

The sequential measurement is represented by the tree in Fig. 13. / 



Figure 13. Sequential measurement modeled as an M-ary erasure channel. 


Thus we have shown that given any arbitrarily small e > 0i we can find a sequent y.l 
measurement that will have performance within e of that of a given generalized 

measurement In this sense we call the two measurements {Qj}^j and 

essentially equivalent measurement . 

If we omit the first stage of the sequential measurement and only perform the self- 
adjoint measurement I the performance vdll not change very much* since the 

resolving power of the first measurement is small. The performance 


Pr[C3] 


M 


S p.Tr{p.n } 
i=l ^ ^ ^ 


(169) 
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i 



(170) 


has the property 

|pr[Cj]-Pr[C3]| < e. 

Hence the single self-adjoint measurement is also essentially equivalent to the general- 
ized measurement; and we have the following theorem. 

Theorem 20 

Given a generalized measurement characterized by an operator-valued measure 
{Qi}^l for an M-ary detection problem with a probability of correct detection Pr[Cj], 
if the Hilbert space that describes the state of the received quantum system is infinite 
dimensional but separable, then for any arbitrarily small e > 0 there is a self-adjoint 
measurement that will give a performance Pr[C 3 ] such that |Pr[Cj] - PriC^Jl < e./ 

The proof is straightforward and is omitted. 

From the proof of Theorem 19. we can see that the condition that the Hilbert space 3C 
be infinite dimensional is not absolutely necessary. Whenever the dimensionality is big 
enough, Theorem 19 holds. The exact dimensionality depends on the operator-valued 
measure and on the set of possible density operators, in a conceptually straightforward 
but mathematically complicated way. Although it is within the realm of the mathematics 
developed in this report to state this exact dimensionality, the result is omitted because 
of its complexity and dubious usefulness. 

SIGNIFICANCE. From Theorem 20, we see that for each generalized measurement 
we can find a conventional observable that gives essentially the same detection perfor- 
mance, if the state of the system is described by an infinite -dimensional space. In opti- 
cal communication, the natural Hilbert space that should be used is the space spanned 
by the photon number states i.|n>}^_Q which is infinite dimensional. A very important 
question then arises, "In optical communication should we consider generalized mea- 
surements at all? " It may be argued that since conventional observables will do almost 
as well in detection problems, generalized measurements should not be considered. In 
some cases, however, the optimal measurement is a generalized measurement. Although 
there are observables that give performances arbitrarily close to it, none actually 
achieves it. In loose mathematical language, it can be said that if we consider the per- 
formance (probability of error) as a form of weak topology on the set of all observables, 
that set is not a closed set. The optimum measurement may not be in the set; hence, 
it will not be feasible sometimes to find an optimum measurement within the set of 
observables. 

We shall now prove the equivalence of Theorems 19 and 20 for the estimation prob- 
lem. The conditions in Theorem 21 are sufficient but not necessary, but they are gen- 
eral enough that most problems satisfy these conditions or can be approximated by them. 


y 
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Theorem 21 


Given a measurement characterized by a generalized resolution of the identity 
^ complex parameter estimation problem with a mean-square error Ij, if 
the Hilbert space that describes the state of the received quantum system is infinite 
dimensional but separable, then for arbitrarily small € > >, there is a self-adjoint 
measurement that will give a mean-square error I 2 . such that 

^ ( 171 ) 

if the following sufficient conditions are satisfied; 

(i) The probability density function for the complex parameter a, p(o) has a com- 
pact support sec. (The support of a complex function f on a topological 
space X is the closure of the set {x;f(x) * o}.) 

(ii) p(c) is continuous. 

(iii) The 'modulation' is uniformly continuous, which means that if a sequence {o.} 

converges to a, the sequence of density operators {p^ } also converges to p . 
in trace norm. That is, * 


Tr{|p^ 0' 

i 


( 172 ) 


and if I a -a. I < 6 , then Tr {[ p^ - 1 1 < e for all values of a e S. 

(iv) The generalized resolution of the identity ® (weakly) and uniformly 

continuous first derivative. That is, 

^a=i^c (17 3) 

has the property that for any operator A with Tr {| a|| < » and if a sequence 
{a.} converges to a, 

Tr {aG^_} - Tr {aG (I 74 ) 

and given any € > 0 . there exists 6 > 0 such that la. -aj <6 implies 

I Tr {aG^ }■ - Tr (aG^II < e, >p(c,c., A./ (175) 

(Note that = /g / Tr {p^G^ ,} | a-e ' [ ^ p(a) d^a 'd^e .) 

The proof of Theorem 21 is given in Appendix N. 

The performance measure in Theorem 21 does not have to be the mean-square error. 
It can be any measure m(c.a') that is uniformly continuous in both variables a and a' on 
the support S of p(a ) . 

The uniform continuity conditions make the proof much simpler, but the theorem is 
provable by requiring that the integrand be measurable. The fact that p(c) has compact 
support is used to show that a finite number of the e.(M) are required to approximate the 


i 


I 

I 

* 

I continuous range of a £ S. and thus it becomes an IVI-ary detection problem. Almost 

5 every density function p(a) has all probability confined to a bounded region. Even if it 

I does not have compact support, the tail of the function can be truncated to make the sup. 

I port compact. 

I EXAMPLE 1 1 

: We now give an example of a ternary detection problem where an operator-valued 

‘ measure characterizes the optimal measurement. Although we can find self-adjoint 

j measurements that perform arbitrarily close to the optimal performance, none actually 

achieves it. 


Consider an infinite -dimensional Hilbert space 3C that is the union of an infinite num- 
ber of two-dimensional orthogonal subspaces {S.}* , such that 

J ^ 


00 

3C = V S.. 
j=i J 


(176) 


For each subspace S., let three vectors {| have the same symmetry as those 

in Example 3 (see Fig. 1). Consider the three density operators. 


P 


i 


“ 1 
Z -ir 

j=l 2^ 



i = 1,2,3. 


(177) 


The optimal measurement is given by the operator-valued measure 



i = 1, 2. 3 


(178) 


which gives a probability of correct detection of 2/3. 

Since the density operators have nonzero though diminishing eigenvalues for all sub- 
spaces. we cannot truncate the density operators by making a first measurement to pro- 
ject it into a finite- dimensional subspace without losing some small but nonzero 
performance. 


57 





I 


XIII. SIMULTANEOUS GENERALIZED MEASUREMENTS 


Thus far. we have extended the notion of quantum measurements to generalized mea- 
surements. The conventional view that measurements are observables corresponding 
to self-adjoint operators entertains the concept of simultaneous measurable quantities. 
Two quantities are said to be simultaneously measurable if and only if the self-adjoint 
operators corresponding to them commute. Thus the quantities A, B are simultaneously 
measurable if and only if [A. B] = AB-BA = 0. Equivalently, if the projector-valued mea- 
sures and are the resolution of the identities of A and B. they are 

simultaneously measurable if and only if there is a third projector-valued measure 
{Rkiksjf that 

(i) n = S R. (179) 

^ kejr. 

and for disjoint subsets ^ of K, so that U Jf . = X. 

1 1 

Vje/. (180) 

3 


and for disjoint subsets {X'}.- ^ 


U x\ = jt. 
}%/ ^ 


of so that 


(181) 


Note that conditions (i) and (ii) are simultaneously satisfied if and only if the mea- 
sures {n.j.lPj}- pairwise commute. That is. 


n^P^-P^n^ = 0. alli.j. 


(182) 


We must now modify the notion of simultaneous measurements. 

In order to determine if two operator-valued measures correspond to simultaneously 
measurable quantities, we look at their respective projector-valued extensions. On a 
common extended Hilbert space 3C if the respective projector-valued measures com- 
mute . then we say that the two operator- valued measures are simultaneously measur- 
able. This definition, although basic, is sometimes not very useful, since it requires an 
examination of the projector-valued measures on a common extension space. Without 
much mathematical difficulty, we can define simultaneous measurability directly on the 
operator- valued measures themselves, which is the thrust of the following theorem. 

Theorem 22 


Two generalized measurements, characterized by the operator -valued measures 

. are simultaneously measurable if and only if there is a third 
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(183) 


generaliz;ed measurement, characterized by the measure so that 


(i) S. = 2 Q.. 

‘ iejT. 

and disjoint subsets of so that 

U JfT. = Jt. 

(ii) T. = S Q 

^ jejfj ^ 

for all j e ^ , and disjoint subsets {jTlI.g of Jf' so that 

U jT! = JT./ 

.iS/ ^ 


(184) 

(185) 


(186) 


The proof of Theorem ZZ is given in Appendix O. 

As we note in Appendix O, without loss of generality we can require for simultaneous 
measurability that there is a measure such that 

allisy (187) 


Si = 2 Q . 


T = 2 Q 

•1 i(E^ ^ 


all j e J, 


(188) 


In some sense the measurement la a finer grain measurement than both the mea- 
surements {Sil and {Tj}, and the outcome statistics of both are obtained from the {Qij} 
measurement by coarse- graining over its outcome statistics. 

When the measures {s.}, {t^} pairwise commute, they are always simultaneously mea- 
surable and is easy to find If we define 


Qy “ all i, j. 


(189) 


{Qij} will satisfy all necessary conditions for simultaneous measurability. 

In Theorem Z3 we give a sufficient but not necessary condition for the simultaneous 
measurability of two operator- valued measures. 

DEFINITION 8. The anticommutator of two operators A, B is defined as 

[A,B]* = AB+BA./ (190) 

Theorem 23 

Two operator-valued measures {^i}ie^' {"l'j}jG^ simultaneously measurable if 
all anticommutators of the form [Si, Tj]’*' are nonnegative -definite. That is. 

[Si-T.]" = SiT. + T.Si ^0. alli.j./ (191) 
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Proof: Define 




r Q = S ^(ST. + T.Sj) = I. 

iey isy ^ ^ ^ ^ ^ 

je/ P-J 

Then {Q. •} is an operator-valued measure with 
ij 


S. = S Q... 

‘ js/ ' 


all i 


T. = 2 Q„. 


all j. 


( 192 ) 

(193) 


(194) 

(195) 


Hence {s.},{t } are simultaneously measurable./ 

It is not easy in general to find the 'finer grain’ measurement {Q^.}. In Appendix P 

we provide a generally very useful construction for the mei sure {Qy}* 

SIGNIFICANCE. We have shown that two simultaneously measurable generalized 
measurements correspond to a single 'finer grain' generalized measurement. Hence 
we shall not get better performance for quantum communication problems by considering 
simultaneously measurable generalized measurements. It is always sufficient to con- 
sider single generalized measurements, since this class also encompasses simultaneous 
generalized measurements. 
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XIV, AN ALTERNATIVE CHARACTERIZATION OF 
GENERALIZED MEASUREMENTS 


Thus far, we have characterized generalized measurements virith operator-valued 
measures. When the operator-valued measure corresponding to a particular measure- 
ment is given, together with the quantum state of a system, the statistics of the outcome 
of that measurement is uniquely specified, in the sense that the probability density func- 
tion (or distribution function) for the outcome is given by Eq. 12, But we can equivalently 
specify the measurement statistics by giving the mean and all higher order moments of 
the outcomes. The probability density can be specified uniquely through the moment- 
generating function (or characteristic function). The specification of moments instead 
of probability densities provides an alternative means of characterizing generalized 
quantum measurements. The operator-valued measure characterization is independent 
of the particular quantum state of the system. It is universal in the sense that Eq. 12 
will give the correct probabilities if we use the correct quantum state for the system. 

To characterize generalized measurements using all order moments of the outcomes, 
the characterization should also be universal, so that the specification will be correct 
for ^ possible quantum states of a system. We shall now propose such a character- 
ization which is equivalent to the characterization by operator-valued measures. We 
suspect that the most likely use for this characterization is in estimation problems, 
since moments are explicitly involved. 

14, 1 Another Characterization of Generalized Quantum Measurements 

Suppose we have a quantum system in an arbitrary quantum state js), and a gener- 
alized measurement is to be performed on it. Without loss of generality we assurr e 
that the outcome is a real number \. We characterize the generalized measurement by 
a sequence of bounded self-adjoint operators {A where A = I s identity operator, 

and the n'^"-order moment of the measurement statistics is given by 

e{\"} = <s|Aj^|s> n = 0,1,2 (196) 

where e{ • } denotes taking expectations. If the state is described by a density oper- 
ator p, 

e{\"} = Tr{pA^}. {197- 

A trivial example is when there is a self-adjoint operator A such that A^ = a”, for 
all n, since the measurement is simply the one characterized by the operator A. 

Not every sequence of self-adjoint operators corresponds to a generalized measure- 
ment. For example, when A^ is not nonnegative -definite the second moment of the out- 
come can have negative values, which is absurd. So a necessary condition for a sequence 
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of operators to correspond to a generalized measurement is that its even indexed oper- 
ators be nonnegative- definite; that is> 

A 2= 0, n even. (198) 

n 

We shall give a necessary and sufficient condition on the sequence {a^^} so that it 
characterizes some generalized measurement. It is obvious from the previous discus- 
sion of generalized measurements that there must exist on an extended Hilbert space 
D 3C, a self-adjoint operator A corresponding to a conventional measurement such 

that 


•^n “ ” 


(199) 


if {a } corresponds to a particular generalized measurement. 

Whenever such an operator A exists on some extended space 3C , we are willing to 
say that {a } characterizes a generalized measurement. Then the necessary and suf- 
ficient condition for the sequence {a^} to characterize a generalized measurement is 
the same as the condition for {a^} to have an extension A that satisfies Eq. 199. When 
we have the observable A defined on an extended Hilbert space 3C . the measurement 
can be realized by embedding JC*" into a tensor product Hilbert space of 3C and some 
apparatus space (see Sec. V). 


14. 2 Necessary and Sufficient Condition for the Existence of an 
Extension to an Observable 

We now give a necessary and sufficient condition for a sequence of self-adjoint oper 
ators to have an extension such as we have just discussed. 


Theorem 24 

Suppose {a^} , n = 0, 1. 2 is a sequence of bounded self-adjoint operators in a 

Hilbert space 3C satisfying the following conditions: 

(i) For every polynomial 

p(\) - + aj\ + + . . . + a^^X*^ (200) 

with real coefficients assuming nonnegative values in some bounded interval -M « X « 
M, we have 


a^A^ + ajAj + ^^ 2.^2 ^ 


+ a_A 5 0. 
n n 


( 201 ) 


(ii) A^ = I. 

+ 

Then there exists a self-adjoint operator A in an extension space 3C 


( 202 ) 
such that 
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(203) 


A„=P3cA*^|3C 


Furthermore, we require that JfC'*’ be minimal in the sense that it be spanned by ^elements 

of the form A"f, where f e 3C and n = 0, 1, 2 In this case the structure {ifC , A,3C} 

is determined to within an isomorphism, and we have 


||A||<M./ <204) 

The proof of this theorem has been given by Riesz and Sz.-3SIagy.^^ We shall outline 
only a particular part of the necessity proof because it correlates this formulation of 
the generalized measurement with the operator-valued measure characterization, which 

we have considered earlier. ^ 

Observe that if A is a self-adjoint operator |j A|| « M on a Hilbert space 3C D 
A will have an orthogonal resolution of the identity such that 


A = 



KdE^, 


where {EjJ is a projector-valued measure and 


(205) 


a" = K"dE^. n = 0,l,2, ... 

When we project a” back into the subspace 3C , we have 


(206) 


Pjc A'’Pjc = 

where {f. s P.^E.. P.^}, in general, is an operator-valued measure. Hence we see that 
if a sequence of bounded self-adjoint operators satisfies the conditions of Theorem 24 
there will always be an operator -valued measure so that the sequence of operators can 
be represented in the form of Eq. 207. 


DISCUSSION. We have provided two essentially equivalent characterizations of 
generalized measurements. It is purely a matter of convenience to choose one partic- 
ular characterization rather than the other. Since the moment characterization involves 
the powers of the eigenvalues of the measurement more explicitly, it may be more use- 
ful in quantum estimation problems. From the characterization of sequential measure- 
ments, however, the operator-valued measure characterization appears to be more 
convenient. 
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XV. CONCLUSION 


We have provided two realizations of generalized measurements. The first realiza- 
tion. involving an apparatus, guarantees a realization for every operator- /alued 
measure. The second method of realization, by sequential measurements, provides 
realization only for several restrictive classes of generalized measurements. But we 
have shown that for a large class of detection and estimation problems sequential mea- 
surements with performance arbitrarily close to the operator -valued measures can be 
found. A very striking and important result is that in both detection and estimation prob- 
lems. under reasonable assumptions, generalized measurements can be replaced by 
self-adjoint observables, with arbitrarily close though sometimes not equal perfor- 
mance. 

From the characterization of sequential measurements, we have noted the important 
fact that measurements characterized by commuting operator -valued measures at most 
can perform as well as self-adjoint observables. In general, they correspond to a 
single self-adjoint measurement followed by a randomized decision. 

Simultaneous generalized measurements are shown to be equivalent to a single 'finer 
grain' generalized measurement. Hence there would be no possibility of improving 
performance by considering such measurements. 

A different approach to characterizing generalized measurements has been proposed. 
It is possible that this characterization will be more useful in estimation problems. 
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Part II, Role of Interactions in Quantum Measurements 


XVI. INTRODUCTION 

In Part I we characterized quantum measurements with a rather abstract mathemat- 
ical language. We claimed that every quantum measurement corresponds to some self- 
adjoint operator on a Hilbert apace that can be larger than the original Hilbert space 
describing the state of the system. Equivalently, we said that quantum measurements 
can be characterized by operator-valued measures defined on the system Hilbert space. 

At various instances, notably in the discussion of sequential measurements, we also 
assumed that the converse is true - that every operator -valued measure can, in prin- 
ciple be physically realized as a measurement. This view is similar to the popular 
concept that the set of all measurable quantities forms a von Neumann algebra gener- 
ated by the set of all self-adjoint operators corresponding to the conjugate coordinates 
of the system, with each member of the algebra being a bounded function of the not nec- 
essarily commuting coordinate operators. For example, the von Neumann algebra gen- 
erated by the positive operator X and momentum operator P is the set of all bounded 

operators on the space of square integrable functions L (X, p), where p is the Lebesgue 
21 

measure. 

There is actually no systematic realization procedure for implementing abstractly 
characterized measurements. For the most part, experimentalists measure physically 
a very small subset of the set of all abstract measurements. In many cases, for example, 
the only known physically measurable quantity is the energy of the system. Some of 
these measurements are performed on the system alone. An example is photon 
counting in the direct detection scheme of optical communication. Other measure- 
ments are performed with the aid of an apparatus that interacts with the system under 
observation, the final measurement being made on either the apparatus or the^ composite 
system. An example of this is heterodyne detection in optical communication where a 
local oscillator field interferes optically with the received field before the combined field 
is detected by means of an energy measurement. Many measurements are in this second 
category, and frequently the final measurement is performed only on the apparatus, and 
the interaction play . the important role of transferring information from the system to 
the apparatus. 

If we are faced with the problem of trying to realize physically a certain abstract 
measurement that does not correspond to any known implementable measurement, it 
would be fruitful to consider different apparatus that are compatible with the system 
under observation. (By compatible, we mean that the apparatus can somehow be ^- 
pled to the system.) We know how to measure some quantities in these apparatus, and 
by an interaction between the apparatus and the system, brought about by suitable cou- 
pling, information about the state of the system is transferred to the apparatus. Thus 
by performing a physically realizable measurement on the apparatus, we obtain the 
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same information about the system as in the abstract measurement. Hence the task of 
realizing the abstract measurement is now transformed to the task of finding an appro- 
priate interaction to transfer the information from the system to the apparatus. While 
we cannot guarantee that any interaction can be brought about by some physically real- 
izable coupling, this method is superior potentially to most ad hoc procedures, and is 
well worth considering. 

Thus the role of interactions in quantum measurements is our central theme in 

Part II. The importance of such interactions has been discussed by many authors (for 

1 8 22 4 

example, von Neumann, d'Cspagnat, and Yuen ). Scant attention has been paid to 
the problem of implementing arbitrary quantum measurements, although d'Espagnat^^ 

4 

and recently Yuen have made some progress along these lines. 

Interactions are also important in sequential measurements. The effectiveness of 
sequential measurements hinges on the very crucial nature of the self-adjoint measure- 
ment at each step. In order for subsequent measurements to add information about the 
original state of the system, previous measurements all must correspond to self-adjoint 
operators that have degenerate eigenspaces. Otherwise, if one of the previous measure- 
ments is 'complete' (i. e. , if each of the eigenvalues of its associated self-adjoint oper- 
ator corresponds to only a single eigenvector), after that measurement the system will 
be in a known pure state, and the outcome statistics of subsequent measurements will 
depend only on this state rather than on the original state of the system; hence, no fur- 
ther information can be gained. Sometimes the dimensionality of the Hilbert space is too 
small for an 'incomplete' measurement. For example, if the system is two-dimensional, 
any measurement on this system must either be a complete measurement or a trivial 
measuremeiit that adds no information (e. g. , the measurement corresponding to the 
identity operator). We have encountered such a situation (see sec, 8. 1) where an appa- 
ratus is brought to interact with the system so that part of the information is transferred 
to the apparatus for the second measurement. Hence via interactions the apparatus (or 
many apparatus) can be used as an information buffer for future measurements. 

In Section XVII we examine several classes of measurements with interactions 
involved. Tn particulsir, we address the problem of the physical realization of am 
abstract measurement, by specifying the interaction that is required to transform the 
joint state of system and apparatus, so that after the interaction, by performing a known 
implementable measurement, the outcome statistics are identical to the abstract mea- 
surement. The interaction will be characterized by specifying the unitary transforma- 
tion U which summarizes its effects. In Section XVIII interactions are stud'“d in 
detail and the unitary operator U is used to find the interaction Hamiltonian Hj, which 
can then be expressed in terms of the generalized coordinates of both the system S and 
the apparatus A. This expression will suggest the coordinates of S and A that should 
be coupled and how they are to be coupled together. 

Section XIX takes into account the constraints of physical laws aind elim .nates inter- 
actions that are not 'allowable.* 
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XVII. SPECIFICATION OF THE INTERACTIONS FOR REALIZATION 
OF QUANTUM MEASUREMENTS 

We shall investigate the properties of two familiar classes of measurements, both 
of which involve an adjoining apparatus. By examming the interactions that take place 
before the measurements are made, we shall give specific suggestions for physical real- 
ization of abstract measurements. 

Class I. The system S under observation is brought into interaction with an appara- 
tus A, and then a self-adjoint measurement is performed on A alone. / 

Class II. The system S under observation is brought into interaction with an appara- 
tus A. and then two self-adjoint measurements are performed, one on S. the other 

on A./ 

We could also consider the class of measurements with the final measurement per- 
formed on S alone, but by symmetry that is equivalent to the Class I considered here. 

Whenever there is no known implementation of an abstractly characterized measure- 
ment, it is fruitful to consider measurements in Classes I and II. If there is a set of 
quantities tliat we know how to measure on A (or on both A and S). we shall try to 
implement an interaction between A and S, so that essentially by measuring one (or 
more) of the measurable quantities on A (or on both) we shall have measured the desired 
abstract measurement. After finding a compatible apparatus with known measurable 
quantities, the first step is to find the required interaction and decide whether there is 
any coupling between A and S that will bring about that interaction. The following prob- 
lem for measurements in Class 1 is useful for detection problems. A modified prob- 
lem for estimation problems will be offered li. er. 

PROBLEM 1 

Given a measurement abstractly cnaracterized by the operator-valued measure 

{Qiiiey find 

(i) an apparatus with a Hill'ert space 3C^, 

(ii) a density operator for the apparatus, 

(iii) an interaction between S and A. whose sole effect is summarized by a unitary 

transformation U on the joint state of S+A, 

[The fact that an interaction can be summarized by a unitary transformation will be 
discussed in Sec. XVIII,] 

(iv) a measurable observable on A alone that is characterized by the projector- 

valued measure {nl._ which forms a resolution of the identity on the space OC^. that 
is. S n. = Ijc (so the set of measures {P. = H. ® is a resolution of the iden- 

tity^(5*fhe space 3Cg ® 3C^ such that P. = 


(208) 


(V) Q. = Tr^{p^uV.u} 

= Tr^{p^u‘*'(n. 0 l3Cg)u}, M i e ^ . 

DISCUSSION. We know (see Sec, V) that we can find the ap-'.aratus space 3C^ and 
the density operator p^. Since the measurement is being perio med on the apparatus, 
the apparatus space 3C^ must have dimensionality greater than or equal to the dimen- 
sionality of the minimal extension space 3C^ of the measure {Q.}. Let the 

projector-valued extension of {Q^} on the space 3Cg 0 3C^. Hence we want to find an 
apparatus U such that 

I 

Rj = uV.U, all i £ Jf. (209) 

Rj and P. are then said to be unitary equivalent. (The subject of unitary equivEilence 
*• * 10 11 23 

has been studied extensively ' ’ .) A necessary and sufficient condition for the two 

measures {r^} and {p.} to be unitary equivalent is 

dim{(ft{R J} = dim {(H{p.}}, all i £ ,/. (210) 

where <R{*} denotes the range space of the operator in braces. 

If this condition is satisfied, there will be a set of isometric mappings from each 
of the range spaces (R{r.} onto the range spaces (R{P|} for all i, and by combining these 
mappings we can specify the tmitary operator U. (Note that unless all range spaces are 
one- dimensional, the isometries and hence the unitary operator U will not be unique.)/ 

We have a similar problem for measurements of Class II. Notice in both classes I 
and II that we assume implicitly that neither the system nor the apparatus is destroyed 
by the interaction; after the interaction, parts of the composite system can still be iden- 
tified as the system and the apparatus. In Class II we have a slightly more stringent 
assumption. We assume that S and A in some sense are uncoupled after interactions, 
and measurements on S will not affect the state of A or vice versa (although the mea- 
surement statistics of the two subsystems will be correlated because of the interaction). 
We present the following problem for measurements in Class II. This is a detection 
probli m. 


PROBLEM 2 

Given a measurement abstractly characterized by the operator-valued measure 

(i) an apparatus with a Hilbert space 3C^, 

(ii) a density operator for the apparatus, 

(iii) an interaction between S and A, whose sole effect is summarized by a unitary 
transformation U on the joint state of S+A, 

(iv) two measurable observables, one on S alone and one on A alone, characterized 
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by the respective projector-valued measures 

projectors {Pj^„ = ^ projector-valued measure defined on 3Cg ® 

Je^. That is, 


s n_ 

m 


m 



( 211 ) 


xni 



( 212 ) 


2 P 
mn 


mn 



® 


( 213 ) 


and such that 

(V) Qi = Tr^{r‘Au’'P„,nU} 


= Tr^{PA“^(nm® nn"“5 

for all i e ,/ and the corresponding m, n. / 


(214) 


DISCUSSION. Problem 2 is almost identical t' Problem 1 except in the necessary 
and sufficient condition; the set is defined for Problem 2,/ 

In the discussion of detection problems the eigenvalues of the observables merely 
serve as labels of the outcomes. But in estimation problems the cost functions also 
depend on the magnitude of the eigenvalues, and both Problems 1 ind 2 must be modified. 


PROBLEM la 

We assume by the extension technique described in Part I that we have already found 
an apparatus space th® density operator p^, and an observable B on JCg ® 3C^, 
which is our desired measurement. (If the original measurement is a generalized mea- 
surement, we assume that B is found to be its observable extension on JCg ® 2fC^.) Given 
a quantity C that we know how to measure on the apparatus, our problem is to see 
whether an interaction can be found such that after the interaction the measurement C 
gives the same statistics as the measurement B without the interaction. Again, the 
necessary and sufficient condition is for B and I^ ® C to be unitary equivalent. That 

s 

is, there exists a unitary operator U such that 

B s! U^djj, ® C)U. (215) 

s 


For two operators to be unitary equivalent, their spectra must be identical. This means 
that if {Ej^} and {Ej^} are their spectral measures, then 


= U^U, 


W \./ 


( 216 ) 
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fThe spectrum of an operator B is the set of all X £ C such that the operator (B-M) 
does not have an inverse. Identical implies that the spectral multiplicities (i. e. , 
the degree of degeneracy of each eigenvalue) must also be identical.] 

PROBLEM 2a 

This problem is similar to Problem la. If B is the abstract observable to be mea- 
sured, and C and D are the two measurable observables on S and A respectively, the 
problem is to find a unitary operator U such that 

B = U^(C ® D)U (217) 

and the conditions on the spectra are the same./ t 

We have now provided a summary of the required interaction by specifying the uni- 
tary transformation that results. Next, we shall show how this unitary transformation 
is related to the interaction Hamiltonian. From the stinicture of the interaction Hamil- 
tonian, we should know how to couple S and A to bring about the desired interaction. 
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XVIII. INTERACTION HAMILTONIAN 


18.1 Characterization of the Dynamics of Quantum Interactions 


When two systems S and A interact, the evolution in time of their joint state is given 
by an interaction Hamiltonian Hj, defined on the same tensor product Hilbert space 3Cg ® 
^ ^ on which the unperturbed Hamiltonian H^ = Hg (gi 

IjC ^3C 9 H^ acts. Hg and H^ 

A S 

are the Hamiltonians of S and A. To determine the dynamics of the interaction H^ is 
replaced by H = H^ + Hj in the Schrddinger equation for the joint state, 

ifi I s+a)> = h| s+a)). (218) 

The formal solution to this equation is 

|s* + a*>) = V(t-t^)is^° + a*°>), (219) 

where V(t-t|^) is a unitary operator defined as 

V(t-t^) = exp{--^ H(t-t^)}. (220) 


It can be verified that 


V(t) V(t') = V(t+t'), (221) 

and hence {v(t)} is a one-parameter unitary Abelian group. It can also be shown that 
V(t) is continuous in the weak topology (i.e. , (x| V(r)|y) is continuous for all t and all 
X, y S 3Cg 9 3C^). 

The dynamics of the interaction described by Eq- 219 is described in the Schrodinger 
or S-picture where the state of the system evolves with time. In another description, 
the Heisehberg or H-picture, the states remain constant in time but every observable A 
evolves as 


A(t) = U^(t) A(0) U(t). (222) 

The S-picture and the H-picture are completely equivalent and will be used interchange- 
ably. 

Sometimes when we wish to describe the sole effect of Hj, it is convenient to remove 
from the equation the time dependence associated with the free Hamiltonians Hg and H^. 
This is accomplished by a unitary transformation on the states, 


Sj + aj)> = exp{-^(Hg O Ijj^ +I 


•'fc ® 

3Cs 




s* + a‘>>. 


(223) 


where the subscript I denotes the change of state with time because of the interaction. 
This description is called the interaction, or Dirac, picture. Equation 218 becomes 
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(224) 


ifi-^ |sj + aj>) = Hj(t) • |s5 + aj>), 

where 

Hj(t) = exp{^(Hg ® Ijc + 1 ^ ® H.)t} • H 

A S ^ 

exp{“ ^ (Hg ® Ijg +Ijjg ® H^)t}. (2E5) 

AS 

The formal solution to the interaction problem, which is well known in time -dependent 
perturbation theory, is often used in scattering and quantum field theories: 



where 


U(t, t^) = T exp{--^ /J Hj(t') dt'}. 


(227) 


with U(t,t^) a unitary operator, and T the time-ordering operator. 

Equations 224, 226, and 227 can be combined to obtain the following differential equa- 
tion for the two-parameter unitary transformation U(t, s) 

^ U(t, s) = - j^Hj(t) U(t, s), (228) 

where 


(229) 


U(t, s) U(s, u) = U(t, u) 

U(t.t) = l, V^t. 

Hence {u(t, s)} is a two-parameter unitary group. In general, unlike the cvie-parameter 
unitary group V(t) in the S-picture, U(t, s) does not depend only on the time difference 
T = t-s, unless Hj commutes with H . In that case. H^(t) = H, for all t. and U(t, s) = 
H, (.-=)}. ‘ ' 

If the joint state of S+A is described by a density operator the time evolution 

r t . . , OX/V 

of IS given by 

Ps+A ~ ^^^*^*0^ Ps+A^^^*"^o^’ 

and the interaction picture 

p{ -u(t,t)pj° uVt„). 

^S+A ° ^S+A ° 


(230) 


(231) 


Heretofore we have considered conservative interactions where the Hamiltonian is 
constant in time- With a little modification of the relevant equations, nonconservative 
interactions can be characterized. Suppose that the interaction Hamiltonian Hj(t) is 
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time-variant, then the SchrOdinger equation that describes the evolution of states can 
be obtained from Eq. 218 by replacing the time-constant Hamiltonian with a time-variant 

one. 

ih^ |s+a)> = H(t)|s+a>), 

where Ui‘ t = + Hj(t). The solution is of the form of Eq. 226; 

t t 

|et + a*)) = W(t.y|s°+a >), 

where W(t.t^) = T ■ exp{--^ H(t') dt'}. In the interaction picture, W(t, t^) is replaced 

by 

(234) 


(233) 


Wi(t.to) - T • exp{-^ Hj(t') dt’}, 


where H,(t) = exp{^ • H^t} Hj(t) • exp{- ^ ' H^t}. 

Thus we can see that the effects of an interaction for a duration of time can always 

be characterized by a unitary transformation. We shall now see whether we can find 
the interaction Hamiltonian if we are given the unitary transformation. 

18.2 Inverse Problem for Finite Duration of Interaction 

We have attempted to specify the interactions required for the realization of quantum 
measurements. That specification is in the form of a unitary operator acting on the 
tensor product space 3Cg ® 3C^. It is very difficult, however, to make suggestions about 
the coupling between S and A that will bring about the interaction by looking at the uni- 
tary operator. We shall now try to find the interaction Hamiltonian (or Hamiltonians) 
that gives such a unitary transformation. This is the inverse of the problem of finding 
the unitary transformation from the interaction Hamiltonian. At first we shall consider 
only finite duration interactions. 

PROBLEM 3 (Schrbdinger Picture, Conservative Interactions) 

Suppose during the time interval from t^ to t^ that the resulting transformation on 
the joint state of S+A in the S-picture is given by the unitary operator U. The trans- 
formatior U deviates from that affected by the free Hamiltonian H^ because of the inter- 
action Hamiltonian Hj. We want to find Hj. 

SOLUTION AND DISCUSSION. We assume that from the time -« to t^ S+A is 
evolving according to the free Hamiltonian. The interaction Hamiltonian Hj is 'turned 
on’ at time t . and continues to affect the system S+A until t^. The turning on of the 
interaction presumably does not affect the states of S+A except in the way predicted by 
the Schrodinger equation. 
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The solution to this problem is well known. Since the one-parameter unitary group 
defined in Eq. 220 is continuous by a theorem of Stone^*^ (see Appendix Q), there exists 
a self-adjoint group generator H > 0 such that 

V(t) = exp{- • Ht} (235) 


and V(tj-t^) = U. In fact, H can be written as 

H = lim-^{U * ° -ij. 
t-0 


(236) 


Then the interaction Hamiltonian is given by 

Ht = H - H , 

I o 


(237) 


If the free Hamiltonian for the apparatus H^ is unknown, then 
s A 


(238) 


In general, there is no unique decomposition into Hj and I-j^ 

.additional assumption that Hj has finite trace (trace class), 
given by 



® H . . But if we make the 
S ^ 

then there is a unique H^ 


(239) 


where {| Sj)}f j is any orthonormal basis in the space 3Cg which we assume to be infinite 
dimensional. This results because Hj is of trace class; hence, (sj H^] s^> must vanish 
as i — 00 and leave 

= Um <3j| l™ I (240) 

Trace class interaction Hamiltonians are very important, since they form a large 

23 2fi 

class wherein time-dependent and time-independent perturbation theories converge. ’ 


PROBLEM 4 (Interaction Picture, Conservative Interactions) 

If we are given the resulting unitary transformation U in the interaction picture, 
there is no known guaranteed procedure for finding Hj directly. If H^ is known, we can 
transform the problem into a problem in the S-picture by specifying the unitary trans- 
formation as 


U' = exp{-^H^(tf-y} U, 


(241) 


and make use of the solution of Problem 3. There is a method for working directly 


74 


within the interaction picture that will probably yield a time constant Hj. but that is a 
particular case of the general nonconservative interaction problem which will be dis- 
cussed next./ 

We shall work entirely iu the interaction picture for nonconservative Interactions. 
The mathc matics in the S -picture is similar, and the only requirement is to put the cor- 
rect quantities in tiiis problem. 


PROBLEM 5 {Nonconservative Interactions) 

Given a unitary operator U which summarizes the effect of a nonconservative inter- 
action between S and A in the interaction picture, we want to find an interaction Hamil- 
tonian (or a class of interaction Hamiltonians), which can be time-variant so that it will 
give the transformation U in the interval from zero to T. / 


SOLUTION AND DISCUSSION. By the Spectral theorem (Theorem 1 , see Appen- 
dix B), there exists an L^-space of ftinctions defined on a domain X with the mea- 
sure M., such that L^(X, tx) is isometric to the space 3Cg 9 3C^, and y : U — exp{if(x)} 
where f(x) is a real- valued function defined on X, and is the isometric mapping. Let 
g(t) be any square -integrable function in the interval (0, T). Let 




V‘> ■ 


! g(t) 1 ^ dt 


l|g(t)ll 


for 0 ^ t T 


(24E) 


where lig(t)|^ = |g(t)i^ dt. 


otherwise 

Then 


hg(t) = 0, t«0 

hg(t)=l, t^T. 

Let 

u (X, t) = exp{if(x) h (t)}. 

§ CD 


(243) 


(244) 


Then 


u (x, 0) = 1 
§ 

Ug(x, T) = exp {if (x)}. 

If is the inverse map from the L^«space onto 3Cg 9 3C^, ^ 
is unitary, with 


"g<*> = 



t «0 
t > T. 


(245) 

; u (x. t) -> U (t) which 

(246) 
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The interaction Hamiltonian in the interaction picture is simply 

: {f(x) hg(t)} = Hf(t) (247) 

which satisfies Eq. 234. And in the S-picture 

H®(t) = exp{^H^t} Hf{t) • exp{- ^ H^t}. (248) 

In general H^(t) will not be constant in time. If it u then it is a solution of Problem 4./ 
Note that the upper time limit T can be «. 


PROBLEM 6 (Impulsive Interaction) 
Let 

Hj(t) = 6(t) Hj. 


Then 


Hj(t) = 6(t) exp{^Hpt} Hj exp H^t} 

= 6{t) Hj. 

The unitary transformation occurring around t = 0 is 
'l t = 0^ 

U(t) = - 

U = exp{-j^Hj} t = 0^. 

If we are given U, Hj can be found by Eq. 236; 

Hj = liml{u*-l}. 
t-0 


(249) 


(250) 


(251) 


(252) 


18.3 Inverse Problem for Infinite Duration of Interaction 

Sometimes it is very difficult to turn on an interaction at some timet = t without 

o 

affecting the state of the system. In such a situation it is desirable to provide the 
coupling for the interaction long before the information- carrying part of the system 
arrivesi so that interaction starts gently but essentially goes on from the period of time 
-00 < t ^0. The final measurement is made at time t=C. The resulting transformation 
in the interaction picture for the duration (-oo, 0) by Eq. 227 is 

U(0j— 00) = lim U(Ojt). 12531 

t— 00 ' ’ 

If ! x) is the state of S+A at t= 0, exp{--^Ht|| x) is its state at an arbitrary time t. After 
removing the dependence on the free Hamiltonian the state in the interaction picture is 
jij" x)* Iti the infinite past, S+A is then in the state 
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(254) 


or 


U_«,)= lim exp{^H^t} • exp{- 
t— 00 


|x>= lim exp{^Ht} exp{-^H t}|x_^) 
t-»-oo 




(255) 


The limit Q exists only for certain conditions on and Hj (for detailed discussion, 
see refs. 23, 26-28). That issue is not important here, since we are interested in the 
inverse problem, where is given. 

If the limit 


S2 = lim exp{^ Ht} exp{- ^ H^t} 


t— 00 

exists, it is in general an isometric ope '■ator and it satisfies the condition 

Hn = RH . 
o 


This can be shown as follows; 

~ (e^^^ e ^^^°) = ie^^^ (H-H^) e 


-itH 


(256) 


(257) 


(258) 


If the limit exists, the derivative in Eq. 258 as t — -« is zero, which implies as t — -«» 

(259) 


. „ -itH 

eitH j g o _ Q 

o 


or 


or 


itH „ itH „ 

e He = e H^e 


-itH„ -itH„ 

TT„ltH O itH O rr 

He e = e e H . 

o 


Therefore as t -» we have HS2 = i2H 


(260) 


(261) 


In the inverse problem is given as the transformation brought about by the inter- 
action, and i2 carries states in the infinite past to states at t = 0 in one-to-one fashion, 
and hence the inverse map can be found. Thus 


H = H n 
o 


-1 


(262) 


or 


H = «H - H . 
I o o 


(26 3) 
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XIX. CONSTRAINTS OF PHYSICAL LAWS ON THE FORM OF 
THE INTERACTION HAMILTONIAN 


We have described several methods of finding the interaction Hamiltonian from a 
given unitary transformation. Not every interaction Hamiltonian corresponds to a real- 
izable interaction. We can narrc the classes of Hamiltonians that have to be considered 
by studying constraints imposed by different physical laws. For example, in a collision 
type of interaction an interaction Hamiltonian that does not conserve linear momentum 
is clearly not admissible. 

19.1 Conservation of Energy 

We consider first the constraints of the law of conservation of energy on the inter- 
29 

action Hamiltonian. 

Assume at some initial time t = 0 that the system S and the apparatus A are not 
interacting and evolve according to their free Hamiltonian H^. If | s + a )) is the joint 
state at this time, the energy of the system at this point is 


'S+A 


// o , o 
= « s + a 


H 


o 0\\ 

s +a ». 


(264) 


After some initial contact time, say > 0. the systems interact, and the joint state 
evolves according to the full Hamiltonian H = H^ + Hj. For any t > t^ 


a*' + s 


‘» = u. 


O , O \v 

a + s )) 


(265) 


where 

= exp{--^ Ht}. (266) 

The energy of the combined system S+A at time t > t^ is 

4+A = « + + 

= «s° + a°jU^HU^|a* + s*». (26?) 

Since H is the generator of the unitary group U^, it commutes with the combined system. 
Hence 

= « s° + a°| H^l a° + s°» + « s'" + a° | HjJ a° + s° » 

= El^^+{{s° + a°\Bj\a° + s°}). (268) 
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Conservation of energy requires 


^S+A ■ ^S+A' 




Hence this implies 


( 269 ) 


«s° + a°|Hj|a°+s°)> = 0. (270) 

If we allow the joint system S+A to have any state in 3Cg 0 3C^, the fact that Hj has 
to be a self-adjoint operator, together with Eq. 270, implies Hj = 0 identically. This 
means that if energy has to be conserved, no nontrivial interaction may occur. 

There are several ways to impose conditions on Hj so that Eq. 270 will be satisfied. 

Condition 1 

(a) Restrict the interaction to be a ' local' interaction. That is, the interaction takes 
place appreciably only when the physical distance of S and A is within certain bounda- 
ries. 

(b) At time t = 0 before any interaction takes place require that the allowable states 

of S+A be within a linear subspace ® ^A’ in sr e sense does not 

fall within the boundaries of the interaction. For a state | s° + a°)) in ^ s+A’ means 
that 


«a° + s°|Hj]s° + a°» = 0. (271) 

In this case the interaction will finally take place at some time t = t^, since S+A will 
evolve according to the free Hamiltonian which eventually carries them into the region 
of interaction. It is clear that cannot be an invariant subspace of H^. Otherwise 

the action of H^ could never carry any state in outside it. Hence the condition 

for nontrivial interaction to take place is 

[H . P .. ]* 0 , (272) 


where P is the projection operator into the subspace ^ g+A’ gives a 

S+A 


description of the process. At t = 0, | a + s )) s S+A" 

«s° + a“|P . |a° + s°» = l. 

*^S+A 

At t = t > t = ' contact' time, 

I a^ + s^» = exp {- H^t} • | a° + s°» 


(273) 


Vja°+s°» 


(274) 


y 
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B 





Probability currant 
due to Hn 


Figure 14. Condition for nontrivial interaction. 


The probability that S+A will be fotind in the subspace at time t is 

Pr {S+A in ^ = <( sS a* | I a^ + s* » 

StA 

V,la°+s°». (275) 

OtA 

Therefore the 'probability current' that crosses the boundary BB' is 
- Ie {Pr{S+A in = -If 

If [H , P ^ ] = 0, no probability current goes into *^g+^ where the interaction takes 

° *^S+A 

place. 

Note that Hj = 0 in v^g+^. Hence and are invariant subspaces of Hj 

but not of H^. Therefore, for nontrivial interaction to occur, 

(277) 


Condition 2 

If we are willing to consider a time-variant Hamiltonian, we can have an interaction 
Hamiltonian Hj(t) such that 

f= 0 t = 0 

H.{t) < 

I Uo t> 0. (278) 

The energy = {( s* + a^ | H^ + H^(t) j a^ + s* )) will not be constant in general, and 

energy is either pumped in or out of the combined system S+A. 
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Condition 3 


In discussions of scattering ' adiabatic switching' is encountered. The interaction 
Hamiltonian is assumed to have the form 


Hj(t) s Hj. 


(279) 


Hence interactions start at some time t -* 0. There is no interaction as t -oo. But 

as t approaches t = ^ , the interaction becomes appreciable. Then the system S+A 

1 

is assumed to be observed at large times (at t -»■ =o). Sy passing to the limit as £ 0, 

we can get a consev-i'ative interaction result and it can be shown that the energy of the 
system at t = -w is equal to the energy at t = +=0. There are subtle problems involved 
in this view, and we shall not discuss it further. 


19. 2 Conservation of an Arbitrary Quantity 

Suppose there are two quantities, characterized by the self-adjoint operators Qg of 
the system S and Qa of the system A, the sum of which is conserved during and after 
an interaction. If [ a + s )) is the state of S+A at time t, this means that the quaintity 


«sUa*lQ|a* + s^»H(Q>^ 
is conserved, where 

-^S ® ^3Cg ®'^A* 

If I a°+ s°)) is the state at t = 0 when no interaction takes place, 
< Q>^ = « s° + a° I v/qv^ 1 a° + s°». 


(280) 


(281) 


(282) 


where is given by Eq. 220. The conservation law for the quantity Q states that (Q)|. 
is constant in time. That is, 

^ < Q)t = « s° + a° 1 1 (Vt^Q V^) 1 a° + s°» = 0 

= «s"’ + a°|V^^{^[H,Q]}V^|a® + s°» 

= «s* + a*|^[H,Q]|aUs*». (283) 

If we allow the state of S+A to be any state in 3Cg ® 3C^, then a necessary and sufficient 
condition for the quantity Q to be conserved is 

[H,Q] = 0. (284) 
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Since in the absence of interactions the quantities Qg and are individnaily conserved, 

[Hg,Qsl-0 

which implies 


Hence, together with Eq. 284, we have 


= 0 . 


(286) 


If are the eigenspaces (invariant subspaces} of Q, the Hamiltonians can be 

written in the form 


M 


H 


i=l 





(287) 


19. 3 Constraints of Superselection Rules 

When the system imder observation admits certain symmetry, not all self-adjoint 
operators are measurable, even in principle. For example, if the system admits a 
rotation symmetry, say around the z axis, then by the definition of symmetry the system 
is indistinguishable from a rotated version of the same system. This implies that no 
measurable quantity can be changed by this rotation. The rotaHonal group around the 

z axis is represented by the unitary transformation U(0) = e where is the 

z-component angular momentum, £ind 0 is the rotated angle. If A is any measurable 
quantity, it will not be affected by this rotation. That is. 



A 



(288) 


which implies 

[J^,A] = 0. 


( 289 ) 


Hence, all measurable quantities must commute with the ' super selection' operator 
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In an arbitrary quantum system, any superselection rule can be represented by a 

superselection operator B such as J , and every measurable quantity must commute 

z 

with it. If we take the von Neumann view of measurable quantities, as long as the bases 
operators of the algebra commute with B, all measurable quantities commute with B. 
When there is more than one superselection rule with superselection operators {b.}^^^, 
a first requirement is for the B^ to pairwise commute, and every measurable quantity 
must commute with each of them. In fact, we can find a maximal supers elect ion oper- 
ator B that contains all the eigenspaces of the B^^, so that any operator commuting with 
B commutes with all B.. Hence we need only consider one superselection operator at a 
time. 

When there is a supers election rule, the density operator that represents the state 
of a system is not always unique. Let be the resolution of the identity of the 

maximal superselection operator B. If A is the measurable quantity to be measured 

+Vi 

on the system with the density operator p, the m moment of the outcome statistics is 
given by Tr {A^p}. But 

[a", B] = 0, all n. (290) 


Therefore 

a"= S Pk^ Pk 
k=l 

and 

„ K 

Tr {a"p} = Tr {( S P a"p. )p} 
k=l ^ 

K 

= Z Tr {Pj^a"p. p}. 
k=l ^ ^ 

Using the identity Tr {AB} = Tr {BA}, we have 
1 ^ 

Tr{A"p}= 2 Tr{A''p,pPj 
k=l ^ ^ 

= Tr{A" S P.pPj 
k=l ^ ^ 


(291) 


(292) 


= Tr {a”^}. 


(293) 


In general 


K 

P= S P.pP, ^P. (294) 

k=l ^ 
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Since both the density operator p and any observable A have to commute with a 
superselection operator B, it is necessary that the unitary transformation U that sum- 
marizes the interaction commute with B. 
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XX. CONCLUSION 


We have made suggestions for implementing abstractly characterized measure- 
ments, by considering the possibility of activating an interaction between the information- 
carrying system and an apparatus, so that when an implementable measurement is 
performed afterward on the composite system the outcome statistics will be the same 
as that in the abstractly characterized measurement. Procedures for finding the 
required interaction Hamiltonians are described. The Hamiltonian is expressed as a 
mathematical function of parameters of the system and the apparatus. Although we do 
not specify exactly how to perform a certain measurement experimentally, this 
procedure provides clues to finding the relevant quantities that should be actively 
involved in the experiment. It is hoped that by observing the form of the interaction 
Hamiltonian, experimentalists will be able to relate abstract measurements to those 
that they can implement experimentally. 


APPENDIX A 


Statement of a Theorem for the Orthogonal Family of Projections 

30 

The following theorem is due to Halmos. 

Theorem 

If P is an operator and if {p.} is a family of projections such that S P. = P, then a 

J J 

necessary and sufficient condition that P be a projection is that Pj i Pj^ whenever j ^ k, 

or, in different language, that{p^} be an orthogonal family of projections. If this con- 

dition is satisfied and if. for each j, the range of P^ is the subspace Ji ^ then the 

range of P is V ^ i * 

j ^ 

30 

The proof has been given by Halmos. 


86 


APPENDIX B 


Spectral Theorems 

We shall state two spectral theorems. The first is due to Riesz and Sz.-Nagy; 

23 

the second is due to Dunford and Schwartz. 

SPECTRAL THEOREM (Riesz and Sz.-Nagy) 

Every self-adjoint transformation A has the representation 

A = /I 

where {Ej^} is a spectral family that is uniquely determined by the transformation A; Ej^ 
commutes with A, as well as with all of the bounded transformations that commute 

with A./ 

SPECT RAL THEOREM (Dunford and Schwartz) 

For every self-adjoint operator A, there exists a measure space (Jl, fi) and an iso- 

9 

metry y of SC into L (fi, such that 
y i A = 

where f is a measurable real— valued function on and m^ is multiplication by f. / 
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APPENDIX C 

Proof of Natmark's Theorem 


Theorem 2 (NaYmark's Theorem) 

Let be an arbitrary resolution of the identity for the space H. Then tliere exists 
a Hilbert space h'*' which contains H as a subspace and there exists an orthogonal reso- 
lution of the identity for the space such that 

for each f e H where P"*^ is the operator of projection on H./ 

a 

Proof (Akhiezer and Glazman ); Consider the set (R of all pairs p of the form 

where A is an arbitrary real interval and f is an arbitrary vector of H, On <R we define 
a function ®(Pj, Pg) such that if p^ = {A^, f^} and p^ = {a^, ^ 2 }’ 

®(Pl*P2> = <i^A^-A/l-V- 


We show that the function $(pj, P 2 > is positive-definite. Indeed, 


'(P,. Pj) =■ fjl ' Ifp ''a, • 4 / 2 I = If'A, • a/2’ 'i> = *<P2’ Pl> 


and, on the other hand, 


s $(P:. Pk> = 2: (F^ . ^ f , y eiij.. 

i, k=l I k I k 1 K 

If the intervals A^(i=l, 2, . . . , n) are pairwise disjoint, then 

. (^a.-a/p V Mk 

i, k=l 1 k 1=1 1 


(C. 1) 


(C. 2) 


If the intervals A. (i=l, 2, . . . , n) are pairwise disjoint ar J the intervals * auJ coin- 
cide, then the sums in the right member of (C. 1) fall into two parts. Jne part, with 
indices from 3 to n, is of the form (C. 2), and the other part, with indicp'^ 1 and 2, 
satirt'ies 


£ iM i 0 - 


'• A..A/P V '^A,VV '^A, J, 6* 6A 


I 
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i‘4^r*iTwr at 1 , 


The case with arbitrary intervals A. (i=l, 2 n) can be reduced, with the aid of 

additional partitions, to the cases already considered. Hence, if fl = 0, then 

Thus *(pj, Pg) is a positive- definite function on (fl. 

Using the method described earlier we imbed (R in a Hilbert space H^. 

Not desiring to introduce new notations for those elements SB of the space H^ which 
are subsets of (R by the construction described earlier, we agree on the following: if 
an element p of (R belongs to ^ then we write p instead of S! , 

We indicate the scalar product in the space H^ by an inferior index ( , ), and have 

' + 

^Pj> P2^+ ~ ^^Pl* ^2^* 

We now consider elements of H"^ of the form {l, f}, I = [-«, oo]. By means of the 
equation 

({l.f}.{l.g})+= (Fjfg) = 

we can identify the pair {I, f} with the element f from H. The element 2 fj^} of 


the space II is identified with the element 2 | f of the space H. Thus, H can be con- 

+ k=l 

sidered as a subspace of the space II . 

We now solve the following problem: find the projection of the element {a, f} of the 
space H on the subspace H. We denote the projection to be found by {l, g}. For each 
h of H, 


(K h}). = 0, 


({a, f}, {l. h})^ - ({l, g}, {I, h})^ = (F^f, h) - (g, h) = (F^f-g, h) = 0, 


so that 


g = Fa1‘. 


f} = {I. Fa^}. 


(C.3) 


The theorem will be proved if it is established that the operator function E^, which 


is defined by 


Et{A’.f} = {-^nAsf} 


(C.4) 
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for each element of the form {A' , f} e is an orthogonal resolution of the identity for 
the space Ir , since then (C. 3) can be expressed in the form 

P+E^f = P'^E^ {l. f} = P'^{A n 1, f} = P+{A, f} = {l, F^f} = F^f 

for each f G H. 

* 1 * * 

It is evident that is an additive operator function of an interval. Furthermore, 
the two equations 

(E+ {A- . f} = {A n A' . f} = {A n A n A' , f} = E^{A' , f}, 

and 

= n A'.f},{A«,g})^= (F^.^,.^„f,g) = (F^,.^.^„f,g) 

= ({^'.4 Ei{Ar., g})^. 

imply that E^ is a projection operator. Finally, it is evident that Ej{A' , f} = {A', f}. 

Since the family of all elements of the form {A', f} is dense in the extension 
+ + 

to H by continuity of the operator E^ defined by formula (C. 4) is an orthogonal 
resolution of the identity for the space H . The theorem is proved. 
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APPENDIX D 


Proof of Theorem 3 

For the statement of Theorem 3 see Section IV. The proof given here is adapted 

1 3 

from Sz.-Nagy and Foias. 

Proof; 

(a) T(e) = Pj(j U (e)/5C = = Ij^.. 

T(s"S = Pjc U = (Pjc U (s)/3C)^‘ = T(s)’‘. 

We have 

S S {Ptp U (t”^s) h(s). h(t)} = 2 2 {U(t) U(s) h(s). h(t)}- 

seG t£G sSG tea 

= II 2 U(s)h(s)l! ^ ^ 0. 

S£G 

(b) Let us consider the set obviously linear, of the finitely nonzero functions 
h<s) from G to 3C, and let us define on 3C’*‘ a bilinear form; 

(h,h'> = 2 2 {T(r^s)h(s).h'(t)) ^ 0 . 


where 

h = h(s). h' = h'(s). 

By using the Schwarz inequality, 

l<h,h'>^ « <h.h> • <h'.h’)- 

so that the h for which <h. h> = 0 form a linear manifold ./K in 3C+. It follows that the 
value of (h»h') does not change if we replace the functions h, h' with equivalent values 
modulo In other words, the form (h.h') defines in a natural way a bilinear form 

(k, k') on the quotient space 3C+ = 3C+/^ . Since the corresponding quadratic form (k, k) 

is positive-definite on 3C+, || k|| = (k, k)^/^ will be a no^rm on 3C+. Thus by completing 3C+ 
with respect to this norm we obtain a Hilbert space 3C . 

Now we embed 3C in 3C+ (and even in 3C+) by id^.mtifying the element h of 3C with the 
function h = 6^(s) h where 6^{e) = 1 and 6^(3) = 0 for s # e or, more precisely, with 
the equivalence class modulo determined by this function. This identification 
is allowed because it preserves the linear and metric structure of 3C. Indeed, we 

have 
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) 


( 6gh, 6^h' ) = S S (T(t"^s) 6g(s) h, 6g(t) h') 3 g 
s t 

= (T(e)h,h') 3 (. 

= (h.h')jg. 

A I A A A A A 

For h = h(s) £ 3C and a e G» we set h = h(a s). Obviously we have (h+h') = h +h' « 
aaaaaa a aaa 

(ch)g^ = ch^, hg = h, (h^)g^ = Furthermore. 

,h' ) = S S (T{r^s)h{a"^s),h'(a"^)) 

^ ^ St 

= S S (T{T"^<r)h((r),h'{T)) 

(T T 
A A 

= (h. h'). 

A A A A A , 

Therefore h E. ^ implies h S and consequently the transformation h h in 3C 

generates a transformation k — of the equivalence classes modulo ^ . Setting U(a) k = 

k , for every a £ G we define a linear transformation of on 3C* such that U(e) = I, 
a •' o o 4. 

U(a) U(b) = U(ab), and (U(a) k. U(a) k') = (k.k')* These transformations on 3C , form a 

representation of the group G. 

Setting 6 (s) = 6 (a ^s). for h, h' £ 3C, we obtain 

cL 0 

(Tt(a) h, h’)jg = ( 5gh' ) 

= s i: (T(t"^s)& (s)h,6 (t)h')j(, 
s t 

= (T(a) hj h’)»jg. 

Hence T{a) = Pr U(a). for every a £ G. 

A ^4' 

Let us observe that every function h = h(s) e 3C can be considered as a finite sum 
of terms of the type 6^{s) h; i. e. . the type (6 (s) h) for o- e G. Hence every element k 

^ V 0 

of can be decomposed into a finite sum of terms of the type U(<r) h for <r £ G, h £ 3C. 

This implies = V U(s) 3C. 

s£G 

The isomorphism of the unitary representations of G satisfying T{s) = Pg^U(s)/3C, 

)rf\3 €. G and SC*” = V U(s) 3C is a consequence of the relation 
s£G 

(U(s)h, U{t)h'} = (U(t/’'u(s)h,h') 

= (U(rSu(s)h.h') 

= {Ufr^s)h,h') 

= (T(t“^s)h,h’), 

which shows that the scalar products of the elements of SC*" of the form U(s) h, U(t) h'. 


for Si t £ Gi hih* G 3C, do not depend upon the particular choice of the unitary represen- 
tation U(s) satisfying our conditions. 

The case when G has a topology and T(s) is a weakly continuous function of s remains 
to be considered. Let us show that then U(s) is also a weakly continuous function of s; 
i.e. I the scalar-valued function (U(s)ki k') is a continuous function of Si for any fixed 
k, k' e 3C Since U(s) has a bound independent of s (in fact, |ju(s)[| = 1) and, moreover, 
the linear combinations of the functions of the form 6^h for o- e G, h G JC (more exactly, 
the corresponding equivalence classes modulo are dense in 3C^, we conclude that 
it suffices to prove that (U(s) 6^h, 6^h’) is a continuous function of s for any fixed 
h, h' G 3C and o-, t G G. This scalar product is equal to 

(U(s)U(o-)h, U(T)h') = (U(T"^so-)h,h’) 

= (T(T"^s(r)h,h'), 

and this is a continuous function of s because T(s) is assumed to be a weakly continuous 
function of s. 

Thus Theorem 3 is proved. / 
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APPENDIX E 


Proof of Theorem 4 


Theorem 4 

Let be an operator-valued measure on the interval 0 « X, ^ 2tt, then there exists 
a projector -valued {Ej^}in some extended space 3C^ C 3C sucii that = PjgEj^/3C for 
all / 

Proof ; The integral 

T(n) = /q e“*^ dFj^ , n = 0,±l,... 


exists and defines an operator function T(n) on the Abelian integer group Z, such that 
T(0) = I. T(-n) = T{n)^ and 

S E (T(n-m)h . h ) = S S d(F,h ,h ) 

nm "”^°nm '\n*m' 

= S E (F(dMh ,h ) 

° n m " ^ 

= {F(d\) S e“^ h . E e^"^ h ) > 0. 

n m ^ 

where the last integral denotes the limit of the sums 

/ in\. inX. . 

2((F(H+,l-FV)Se “h„,Ee “h^). 


with \^ = 0 < < . . . < < . . . < and max "*■ 

Hence, by part (b) of Theorem 3, there exists a unitary operator U = 
on an extended space 3C^ C 3C such that 

T(u) = PjgUtnj/K , n = 0, ±1, . . . 


^iX, 


dE^ 


i. e. , e^"^ d(F^h, h' ) = e""^ d(E^h, h' ). 


h, h' e 3C 


and E^ is a projector-valued measure, and it can be chosen so that it satisfies the same 
condition of normalization as L e. , = ^\+0’ ®o ” ^2v-0 “ ^30' 

equation implies Fj^ = Pj^E^^/K. / 
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APPENDIX F 


Proof of Theorem 5 


Theorem 5 

M 

For an arbitrary operator- valued measure ^ Qj = I* whose index set has 

a finite cardinality M, the dimensionality of the minimal extended Hilbert space 
min 3C'*‘, is less than or equal to M times the dimensionality of the Hilbert space 3C. 
That is, 

dim min {KC'*'} < M dim {3C}. / 

Proof: The minimality condition of Theorem 3 is 

min 3C‘‘‘ = V U{n) 3C, 
n=0 

where 

U(n) = ej”’^ dE^, 

with j = and {e^J is a resolution of the identity. For a finite set of the Q. the inte- 
gral becomes the sum 

M jnX, 

U(n) = S e ^ Q.. 
i=l 


where the are M distinct real numbers chosen arbitrarily. 
Let 


N _ 2iri 
^i " M 


i = 1 , . , , , M. 


Then 


M 


U(n) = 2 exp{j ^ i} 

j=l 


U(M) = U(0) = Igg. 


27T{M+£) 

U(M+£) = S exp{j jjj i} Q. 
i 


= Z exp{ j ^ i} Q. 


1 

mn). 


Hence, with this choice of the the unitary group U(n) repeats itself every 


■ ' ■V--5-MS. -su-iUJaa 


,v -.iscu '-t-sssa 'Ja 
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M increments on the Index n, and the minimality condition becomes 


min 3C‘‘‘ = V U(n) 3C 
n=0 


M-1 
= { V 
n=0 


U(n) JJC} V 


^2M-1 
{ V 
n=M 


U(n) 3C} V . , . 


,M-1 , M-1 

= { V U(n) 3C} V { V 
n=0 n=0 


U(n) 3fC} V . . . 


M-1 

= V U(n) 3C. 
n=0 

Since U(n) is a unitary operator, each of the spaces = U(n) JJC, n = 0,l,..., M-1, 
has dimensionality equal to dim {3C}. (Note = 3C. ) For n ^ m, any two of these 
spaces may not be orthogonal. But if we assume that they are indeed orthog- 

onal, we can arrive at a union bound for dim {min 3C^}, 

r +1 1 

dim {min = dim { V U(n) 5C} 

n=0 


M-1 

= dim { V 


n=0 



M-I 

< S dim {if } 
n=0 " 


= M dim{3C}./ 


APPENDIX G 


Proof of Theorem 6 


Theorem 6 

If the operator-valued measure has the property that every is propor- 

tional to a corresponding projection operator that projects into a one - dimensional sub- 
space of 5C {i . e . , Qa = % kfl )<% 1 ' where 1 ^ 3^ 0, and | > is a vector with unit 

norm)f then the minimal extended space has dimensionality equal to the cardinality of 
the index A (card{A}). That is. 

dim {min 3C^} = card {a}. / 

Proof ; Let the projector-valued measure he the minimal extension of the 

operator-valued measure hie minimal extended space min 3C'*’ such that 


and 


S 

aSA 


n. 



Each projector 11^ projects into a subspace of min 3C*^. We shall show that if 
min 3C^ is minimal. is a one -dimensional subspace. jr 

Assume is not a one -dimensional subspace for some a. Let ^ complete 

orthonormal basis for this S so that K is an integer bigger than one. since S by 

an 

assumption is multidimensional. Then 
k=l 

Let 

where the vectors | g^) are no longer orthogonal nor do tiiey have unit norms in general. 
Hence 

= S |g®> (g“l 

k=l ^ 

= qjq„)<qj- 
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Each of the vectors Ig^^) must be proportional to otherwise it can be seen that 

Q is a nonzero operator over more than one dimensioni simply by orthogonalizing the 

set expressing in these coordinates. 

Hence we have =2^1%) where is a complex number, an ! 

= 141^ \%><%\ 


which implies 


K. 


q„= S Ukl • 

“ k=i 


Now let 


=”4*' O 


141 


a I Z 


1 . 


and 



Therefore 

=Qa- 

Since | h^) is a linear combination of vectors in S^. | h^ ) <h^ [ is also an exten- 

sion of orthogonal to other 11^,, a' * a. Furthermore, 11^ projects into a one- 
dimensional subspace, which means that the operator-valued measure with replaced 
by n , is an extension of the operator -valued measure Q and has an extended space with 
a smaller dimensionality than min 3C , which by assumption is the minimal extended 
space. Hence we have arrived at a contradiction. Therefore, for the minimal extension 
space, every projector-valued measure projects into a one -dimensional subspace S^. 
Since 

S n = I . 

a£A ** min 
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min JC*" = U 
aGA 


Therefore 


dim {min 3C’*'} = E dim {s } 


aSA 

= S K 
a£A ® 


= S = 1 
aSA 

= card {a}. 


I for a ^ a 
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APPENDIX H 


Proofs ci Theorem 7 


Theorem 7 

Given an operator-valued measure (RIQ^} denote the range space of 

{q^}, o e a, then 

dim {min = "E dim {(R{Q_}}. / 
aeA 

Proof ; We shall prove 

(i) dim {min S dim{tfl{Q }}. 

aG.A 

Then we shall show 

(ii) dim {min 5 s dim{R{Q }}. 

aGA 

so that the two quantities on each side must be equal. 

(i) Since each Q is a nonnegative -definite self-adjoint operator, there exists for 
' ^ C 

each an orthogonal set of vectors {|*lk^}j{=l> such that is diagonalized by these 
vectors, and where is an integer larger than zero. That is, 

«<, = J', <%!• 


and 1 ^ q^ S' 0. 

K 

The set of vectors {|l^)}(j=j^ cSA fact, we have 

I = S 


oG A 


K. 




K 


Therefore the set of one -dimensional operators, {P^ = ^*^kf^k=l, e£A ^ gener- 

alized resolution of the identity in 3C, and each is proportional to a one-dimensional 

projector. It is clear that an extension for the sd {^jjl^k=l, aGA a^so an extension 

for {Q } A. since each Q can be obtained by summing over K of the operators in the 
■^^a^cGA a . » ^ i * • 

former set. Bui. by Theorem 6 we know the dimensionality of the minimal extension 

space for the set of one -dimension operators {P^j^“j^ and that it is equal to the 
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cardinality of the index set 


K ^ 

dim {min 5C"*} for {pJJ,.®, „cA “ ^ 1 

tflc-l.aCA 


= r K . 
ceA 


But is the number of dimensions over which is nonzero. That is, K is the 
dimensionality of the rauige space of Q^, 

K = dim {(ft{Q >}. 


K 

Since an extension for the resolution of the identity Is also an extension 

for the resolution of the Identity {Q } ^ . 

e^oSA, It IS clear that the dimensionality of the 

minimal extended space for the is upper bounded by the dimensionality of the min- 
imal extended space for the , Hence 

dim {min ifC"*} for {Q } 


a^osA 


s dim {min tor {P^k“,,„eA 

= 2 K 
cSA ® 

= 2 dim {(R{Q }}. 
cSA ® 

We have proved (i). 

(ii) Now we wish to prove 


dim {min SC"*} 5 2 dim {tfl{Q }}. 

EGA ® 

Let the projector-valued measure be the minimal extension of the operator- 

valued measure on the extended space min 3C^ such that 


^0 ~ 


eSA minJfC^ 


Since the projectors 11^ are all orthogonal to each other (for the proof see Riesz 
and Sz.-Nagy ), the minimal extended space is simply the union of al’ subspaces into 
which the projectors 11^ project. Hence the dimensionality of min ,'fC^ is 


dim {..lin 


3C'^= 2 dim {«{n J} 

flSA ^ 
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Let us assume that 

dim {min 3C"*} < Z dim {(R{Q_}}. 

aSA ® 

Then there exists an a such that 

dim {tfl{n^}} < dim {(R{Q^}} = dim ^ dim 

which Is a contradiction. Therefore the inequality (ii) is true. Putting (i) and (ii) 

together, we have proved that dim {min SC"*} ~ Z dim {(R{Q^}}. / 

aSA ® 

In this proof It is assiimed that every has a complete set of eigenvectors. Strictly 
speaking, in an infinite dimensional Hilbert space only compact operators are guaran- 
teed to have a set of complete eigenvectors. Although there are cases when this assump- 
tion is incorrect, it does provide a heuristic proof of correct results. We shall give 
an alternative proof that does not depend on this assumption but leads to the same con- 
clusion. 

Alternative Proof of Theorem 7 : For each a S A we have 

where 11^ is a projection operator. 

Assume for the minimal extension 
dim {(R{Qjj}} < dim {(R{II^}} 
for some a £ A, We have 

= ^(R{Q„}^aP(R{Qj 
' ^(ft{Qj^3C^a^ifC^(fl{Qj 
“ ^3C^(H{Qj^a^(fl{Qj^(fC- 

Let Sjj be the closure of the range of 11^ when restricted to (R{Q^}. 

Then dim{s } « dim {®{qJ'} <dim {(fl{n 1} and S C (R{n } is the range space of n„ 
which implies 
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Hence 


I 
» 

“ ^K^(R{Qj^S^”a^S^^(R{Qj^3fC 

“ *’3C^(R{qJ*’s^^(R{qJ^3C 

" ^(R{Qg}^3C^Sg^3C^<R{Qj 

■* “ 

JV. ov 

C 

Therefore Pg is a projection operator and, together with the other II^,, a» # a is a 
projector -valued extension of the operator-valued measure But by assumption 

dim {<R{Pc }} = dim {s I < dim {tfl{n I}. 

O tt u 

Q 

Hence the set{ll^}^^^is not a minimal extension. And for a minimal extension, we 
must have 

dim {(fi{Qg}} ^ dim {«fi{n^}}, e A. 

It is easy to show that 

dim{(fi{Q^}} ^ dim {(R{n^}}, Va S A. 

I So for the minimal extension we have the equality 

r 

dim {(R{Q^}} = dim {(R{i1^}} 

and 

dim{min!IC^= £ dim{(R{n}} 
aSA “ 

a 2 dim{(R{Q}}./ 
aSA “ 


( 

I 
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APPENDIX I 


Proof of Corollary 3 

COROLLARY 3. The construction of the projector -valued measure and the extended 
space provided by Natmark's theorem (Theorem 2) is always the minimal extension./ 

Proof ; The proof of NaTmark's theorem in Appendix C is a proof by constructior.. 
That iSi a construction for the projector-valued measure is actually given for any 
arbitrary operator- valued measure {Q^I* We titall show that the resulting extended 
space in this construction is indeed minimal. First, we sketch another proof of Theo- 
rem 5 using NaTmark's theorem. 

In NaTmark's theorem the extended Hilbert space is spanned by the set of pairs 
{p a (A, f) for all subintervals A in the interval I = (0,2], and all f e h}. If we have M 
Qj's where M is a finite number, we can pick M points in the interval (0, 2ff] 

where changes values. Let these points be 

0 = < \, < X., < ... < X.-, = 2ff. 

old Al 

The points divide I into M subintervals, 

Now the M sets of pairs {p = (Aj, f), all f £ are orthogonal to each other, since 

the inner products between ary two pairs, one from each set, by definition is 



for any f, h £ H, i j. 

Furthermore, these M sots of pairs span H. Individually, each of these sets 
inchidei 3lements of the form (A., f) for all f £ H, so each has at most dimensionality 
equal to dim {h}. Hence we have 

M 

dim {h"^} « 2 dim {h} = M dim {h|, 
i=l 

which is Theorem 5. 

DISCUSSION. We now consider the interval A. that contains the point \^, F^ = 

Q.. We can show that the dimensionality of the subspace spanned by the set {(A^, f), all 

f £ h} is equal to dim Let S. be the range space of Q.. For any vector f orthog- 

onal to all elements in S., the square of the length of the vector (A^, f) is 

{(Aj, f), (A^, f)} = (F^^f, f) = (Q^f, f) = 0. 

Hence for all f i S., (A., f) = 0 is a trivial zero element. Whereas for g £ Sj^, 
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{(A., g), (A., g)} = (Qjg, g) > 0 

because g is in the range space of Q^. Therefore 
dirr. {(A., f). all f € h} = dim {(R{Qj}} 

and 


dim {H7 = Z dim {(A., f), all f € h} 
i=l ^ 

M 

= S dim 
i=l ^ 

This condition satisfies the minimality condition given by Theorem 7 . Hence the 
construction in Naimark's theorem (Theorem 2) gives the minimal extension./ 
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APPENDIX J 


Sequen tial Detection of Signals Transmitted by a Quantum 
System (Equlprobable Binary Pure State) 


Suppose we want to transmit a binary signal with a quantum system S that is not cor- 
rupted by noise (see Chan^^). The system is in state [ s^> when digit "0" is sent, and 
in state | s^ ) when digit ”1" is sent. Let the a priori probabilities that the digits ”0" 
and "1" are sent each be equal to one-half. The performance of detection is given by 
the probability of error. We try to consider the performance of a sequential detection 
scheme by bringing an apparatus A to interact with the system S and then performing 
a measurement on S and then on A, or vice versa. The structure of the second mea- 
surement is optimized as a consequence of the outcome of the first measurement. In 
Section VIII we considered the case in which the joint state of S and A can be factored 
into the tensor product of a state in S and a state in A. In general, the joint state of S 
and A does not factor, and we now wish to treat this general case. 

Let the initial state of A before interaction be 1 a^). If digit "0" is sent, the joint 
state of S+A before interaction is |s^)[a^). If digit "1" is sent, the state is Is, )la^). 

The interaction between S and A can be characterized by a unitary transformation U 
on the joint state of S+A. 

l®o + 4» = ^l®o>l^o> 


!85+a^j» = ulsj>|a^>. 


By symmetry of the equiprobability of digits "1" and "0", we select a measurement 
on A characterized by the self-adjoint operator such that the probability that it will 
decide a "0", given that "0" is sent, is equal to the probability that it will decide on "1", 
given that " 1 " is sent. Let | <1)^ > and | <|>j > be its eigenstates . Then {| <[). j ^ 2 


the Hilbert space, ^ arbitrary orthonormal basis in the Hilbert 

space, 3Cg. Then the orthonormal set 2 is a complete orthonormal basis 

for the tensor product Hilbert apace ® 3Cg. ^ 


Then 



S 

i=1.2 

3 = 1.2 

S 

i=l. 2 
j=l,2 


byUpl+j). 


where a^j and b^^ are complex numbers. Since unitary transformations preserve inner 
products. 
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s 

i=l,2 

j=1.2 


b. .a. . 

13 1 ] 


= <s,|So>. 

If we perform the measurement charaoterlaed by O^. the probabiUtlea that we shall 
find A in state U„> and | + ,>. Biven that digit "i" or digit -O” Is sent, are 

Pr[U„>|o]= f Ibojl^ 


Pr[|v!0=.J^Jb„3!^ 


Pr[|<t>i)|l]= . 2 : ibijl^- 

j=l I 2 

But by symmetry we choose Pr[j 4 >q>I o] = Pr[l bj 3 ] 


Pr[Ui>lo] = Pr[l<i>i>l !]♦ 

Given as a result of the measurement that we find system A to be in state I 4 . 0 ). we 
wish to update the a priori probabilities 01 digits "I" and -O". Using Bayes' rule, we 

obtain 


Pr[ol |<j.Q>] = 


Pr[l<l>Q)l0]F^^t0l 


Pr[0]=| 

Pr[Uo)l = 

= Y <!>o> 1 ®1+ 1 ^ I 


J_ 

" 2 

/. Pr[0lUo>]= Pr[|<t>o)|0] 


= S 
j=U2 



2 


Pr[l| l<|>o>] 


S 

3=1.2 



2 


S 

3=1.2 
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a«aejr«awet3nw>ii=-.t 


is senti 


Given that the outcome is U^). the system S is now in well-defined states. If "0" 




Ui) 




j=l»2 




If "1" is senti 


S b .U.) 
f. j=1.2 
|si> = 


y.z 

After the measurement on A we have a new set of a priori probabilities and a new 
set of states for system S. We choose a measurement on S characterized by the self- 
adjoint operator Og such that the performance is optimum. From previous calculations 
the probability of error, given as a result of the first measurement, is 

Pr[£| k„>] 4 |l - [l -4Pr[0| U„»Pr[l| 




I S b'a ,|2 
j=l,2 




By symmetry 


>-[,- 41 , 

Minimizing Pr[c ], subject to the inner product constraint, 2 b’*'a =^sls\ vi^lrie 

i=l,2 yieios 

. , j=l*2 

- a/' -I<=iIs„)I^ • 

This is the same result that was derived for the case when the joint state of S+A can be 
factored into the tensor product of states in S and A, 
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APPENDIX K 


Proof of Theorem 14 


Theorem 14 

If an operator-valued measure is defined on a finite index set , with 

values as operators in a finite dimensional Hilbert space 3C (dim {3C} = N}, and 
the measures {Q.} pairwise commute, then it can always be realized by a sequential 
measiurement characterized by a tree with self-adjoint measurements at each vertex. 
In particular, if M < N, the sequential measurement can be characterized by a tree of 
length 2. In general, the minimum length of the tree required is the smallest inte- 
ger JL such that 

log M 


Proof; 

(1) Let us prove the case for M = N. Note that the case M < N can be made to cor- 
respond to M = N by defining 


Q. = 0 


for i = M+1, . . . , N. 


So is an operator -valued measure and S = Ijg* 

Since the Q^. pairwise commute, on a finite dimensional Hilbert space 3C they can 
be diagonalized simultaneously by a set of complete orthonormal eigenvectors {|b.)}^j, 
where N is a finite integer (equals dim {3C}). That is, 

N 


N 


Qi = .2 qjib.) (b.|. M 


with <lj ^ 0. foi* all i, j, and 


i 

Z qU 1, 

i=l J 




(K. 1) 


(|V<b.|)(lb.,)<b.,|}=6..,|bp<b.|, \Aj.j-. 

Let us perform a self-adjoint measurement on the system characterized by the 
projector-valued measure 


The possible outcomes can be modeled by the N branches of the tree of length 1, 
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as shown in Pig, K. 1. 

Suppose the outcome of the first measurement is b^. Let a second self-adjoint 
measurement be performed. Let the projector- valued measure for this measurement 
be {p^ s [c?) where is a complete orthonormal basis of 3C. The N 

possible outcomes of the second measurement can be modeled by the N branches of the 
' subtree* in Fig. K. 2. 



Figure K. 2 


B> the results in Section IX, the operator-valued measure R.. for iach path (i. e. , 
each path (a^, b^. cj) for all i, j) is given by 

R .. = n.pk . 

j 1 3 

= IV <‘’jl‘^i> <c|lb.) <b.j 

= (K.2] 

Let be an arbitrary complete orthonormal basis, and let 

IV = 1, 


no 


y 


1 


From Eq. K. 1, 


(bJbj) = S 1. 

J J J 


Then 



>/i. 


But since | b^) and | b^) are both unit norm vectors, 
tion Uj (that is not unique) such that 


there exists a unitary transforma- 



So It we choose the second self-adjoint measurement such that 

Wl, 

the operator-valued measure for the path (a^, b., c|). from Eq. K. 2 

= “51V <^i- 


Let us perform such second measurement on all outcomes b.. and identity each out- 
come 1 m the inden set of operator-valued measure fo no - - 
Of all paths ,a„.h,cj,,jv 1 endh.. in thT: r£ 

scrip, i. Then the operator-valued measure of the sum of ai; L th;;; iTls 


N 

2 

j=l 


R.. 


i?i V “‘^i* 


The sequential measurement can then be characterized by the tree in Fig. k. 3. Hence 

{Q generalized measurement given by the operator- valued measure 

^^i-*ial n sequential measurement. 


Ill 



Figure K. 3 


(ii) Let us prove the case for M > N. The method of constructing the sequential 
measurement is similar to the case M ^ N, except that in general the sequential mea** 
surement must have more than two steps. Let ^ operator- valued mea- 

sures such that they pairwise commute and M > N = dim {JC}. 

Since they commute, they can be diagonalized simultaneously by a complete ortho- 
normal basis {|b^)}^p such that 



As in part (i), let us first perform the self-adjoint measurement corresponding to 
the projector-valued measures = i ) (b^| so that the initial part of the tree 

characterizing the sequential measurement is given by Fig. K. 1. ^ 

For each of the N one- dimensional subspaces spanned by the N vectors {j 
we can define a resolution of the identity given by the Q^, since 

M . 

= the identity operator of the j one-dimensional subspace spanned 

by I bj). 
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So the set of one-dimensional positive operators {"Ijlbj) ® resolution of the 

identity. Whenever any of these equals zero, we can delete them from the res- 

olution of the identity without loss of generality. If the number of nonzero for some j 
is smaller than N = dim {3C}, it is obvious that we can perform a second self-adjoint 
measurement at those vertices in exactly the same fashion as in the proof of part (i), 
and we proceed accordingly. The problem is when the number of nonzero qj exceeds 
the number N = dim {3C}. By Theorem 6, an extended space of dimensionality equal to 
the number of nonzero qj is required. Certainly the original Hilbert space with less 
dimensions will not suffice. Let the number of nonzero q^ be Mj so that N < Mj M. 

We group the set of Mj positive operators {q^|b^) (hj|} into N subsets (groups), since 
we want each subset to have as few members as possible. We try to group the oper- 
ators as evenly and optimally as possible; hence, the minimum for the maximum number 
in each of these N subsets is given by the smallest integer Nj such that NNj ^ M^. We 
can indicate the partition symbolically by Fig. K. 4. 



Partition Into 
N subsets 
Moximum number 
of members in 
each subset is 
N|. 


For each of these N subsets, if we sum the operators within the subset, we get a 
single one -dimensional operator. Then the N resulting one- dimensional operators (one 

from each subset) form a resolution of the 
identity that has a projector-valued exten- 
sion on an N-dimensional space. Thus it is 
possible to perform a second self-adjoint 
measurement exactly like that in part (i) 
(indicated by Fig. K. 2) to ' separate' these 
N subsets of outcomes. The process is 
indicated symbolically in Fig. K. 5. 

each of 



c| subset {l} 
Cg subset {2} 

C| subset {i} 
c|| subset {n} 


If N. < N we can 'separate' 

J 

the subset of members into their indi- 
vidual members by performing a third 
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measurement. The nature of this measurement is exactly ansilogous to that of the second 
measurement, the construction of which is given in part (i). Then we can identify the 
measures {Q*} fcy summing the measures for the appropriate paths as in part (i). But 
the tree now has length 3 instead of Z, 

If Nj > N we have to 'separate' each subset that has more thsin N members into N 
finer subsets, and this can be done by a reiteration of the procedure that has been 
described. This 'separation' process is repeated (by measuring a sequence of self- 
adjoint measurements) until the number of members in each subset is less than N. Then 
the final measurement corresponding to the second measurement of part (i) is per- 
formed, and the measures are identified by sunaming over the measures of the 
appropriate paths. 

This construction demonstrates that if 0 < M ^N, we only need a tree of length 2. 

2 

For N < M ^ N we need a tree of length 3. In general the minimal length of the tree 

log M 

that is required is the smallest integer £ such that £ 3= 1 + « / 
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APPENDIX L 


Extension of Theorem 14 

When the Hilbert space is infinite dimensional but separable! Theorem 14 can be 
extended to handle the situation. We shall sketch how this theorem can be generalized. 

Since the operator- valued measures (still defined on a finite index set) pairwise com- 
mute! they can be diagonalized simultaneously. It is then possible to find an infinite 
number of finite dimensional orthogonal subspaces of 3C such that if 

corresponds to the projection operator on these subspaces . then 

with 

Given this decomposition, we can separate the sequential measurement into an infi- 
nite number of steps. For example! we can separate the resolution of the identity in 
the first subspace from the rest of the subspaces by performing a first measurement 
corresponding to the binary projector-valued measure P^ and Ijg •* Pj as in Fig. L. 1. If 



Figure L. 1 


the outcome is in the vertex corresponding to Sj, we can make use of the construction 
In Theorem 14 to 'separate' the measures further by sequential measurements. If the 
outcome is in the other vertex, we can devise a second measurement (just like the first 
one) to separate S 2 from the rest of the subspaces. Eventually, we should be able to 
'separate' the whole space K. although we may have to use a sequential measurement 
with infinite length. But with a judicious choice of subspaces we can guarantee 
that with probability close to one. that the measurement will terminate after a finite 
number of steps. This fact will become apparent after Section XII. 

There is still another way to construct a sequential measurement for the infinite 
dimensional case. If we are willing to perform a self-adjoint measurement that has an 
infinite number of possible outcomes, by the first measurement v.'e can immediately 
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separate the measures into one -dimensional subspaces as in Theorem 14. Now there 
will be an infinite number of second-level vertices. But because of von Neumann's pro- 
jection postulate only one of these vertices will be the outcome and that is all we have 
to deal with in the second measurement. This will enable us to guarantee that for all 
possible situations the sequential measurement will have a finite number of steps. 

When the operator -valued measure is defined on Ein infinite index set, the situation 
will not be different from the first index set case, except that there will be an infinite 
number of outcomes at the final measurement of each path (instead of a finite number). 
Hence we have the general result, which is stated in Section IX as Theorem 15. 
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APPENDIX M 


Procedure to Find a ' Finest* Decomposition of the Hilbert Space 3C 
into Simultaneous Invariant Subspaces 

The main statement that it is possible to find a unique finest set of simultaneous 
invariant subspaces that are pairwise orthogonal is given in Theorem 18. We shall pre- 
scribe a construction procedure to find the finest simultaneous invariant subspaces of a 
set of bounded self-adjoint operators 

DEFINITION. A partially ordered system (S, <) is a nonempty set S, together with 
a relation ^ on S such that 

(a) if a b and b ^ c, then a < c 

(b) a < a. / 

The < relation is called an order relation in S. 

DEFINITION. If B is a subset of a partiaU.y ordered system (S. then an ele- 
ment X in S is said to be a lower bound if every y S B has the property x ^y. A lower 
bound X for B is said to be a greatest lower bound if every lower bound z of B has the 
property z ^x./ 

A similar definition can be given for the least upper bound. 

DEFINITION. A partially ordered set S is a lattice if every pair x, y S S has a least 
upper bound and a greatest lower bound, denoted by x V y. and x A y. respectively. The 
lattice S has a unU if there exists an eleraent 1 s ich that x ^ 1, for all x e S, and a zero 
if there exists an element 0 such that 0 < x, for all x e S. The lattice is called distrib- 
utive if 

X A (y V z) = (X A y) V (x A z), x, y, z € S, 
and complemented if for every x in S, there exists an x' in S such that 
X V x' = 1, 

X A x' = 0./ 

DEFINITION. A Boolean algebra is a lattice with unit and zero which is distributive 
and complemented. / 

For example, the family of all subsets of a set S with inclusion as order relation is 
a Boolean algebra (see Dunford and Schwartz^^). If A, B are subsets of S, A « B if and 
only if A C B, The unit element is S, and the zero is 0 , the empty set. 

AABsAflB. AVBsaUb. 

We have noted that every bounded self-adjoint operator has a unique resolution of the 
identity, which defines a projector-valued measure on the Borel measurable sets of the 
real line. Furthermore, the projector-valued measures of any two Borel sets commute. 
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Consider then the family of projection operators {Pplptrg that are measures of all Borel 
measurable sets p on the real line <R. If we define the relations 

(i) PjPg = implies the order relation Pj ^ ^2 

(ii) PiAPg-PjPg 

(iii) P, VP2 = Pi+P2-^I^2 


for every pair of projection operators in this familyr then this famil;;.’ of projectors fo: nis 
a Boolean algebra. If we consider tlie subspaces of the Hilbert space 3C that 

are the range spaces of this family of commuting projectors {PpIp^B and define the rela- 
tions 


(i) 

(li) 

(iii) 


S„ ‘S S_ if Sp. C s„ (partial order by inclusion) 

Pi Pz Pi Pz f 

So V Sp, = least subspace of 3C that contains So » So 

Pi Pz Pi Pz 

So A So = greatest subspace of JC contained in botli> 

Pi Pz 


then the system {{Splp^gi is a Boolean algebra. 

Consider for each bounded self-adjoint operator T^. a S A. the corresponding Boolean 
algebra of sub spaces 


Each of the subspace S^ is an invariant subspae^ of T^. To find the simultaneous invari- 
ant subspace of the set we find in some sense the intersection of all Boolean 

algebras of subspaces. Specifically we form the family of all subspaces such 

that 


S 


\ 


A s; 

oeA P 


a 


for all possible combinations of the {p^}- 

The family of subspaces have corresponding projection operators that pair- 

wise commute and. in fact, {{s^. c} is a Boolean algebra (the proof is simple but tedious 
and is omitted) . 

To find the finest decomposition of 3C into the subspaces where N can be a 

finite integer or the countable infinity we single out the subspaces {s.} 
so that the null space {o} is the only subspace in the algebra {s^} that is included in each 
of the subspaces S.. This is possible because {{s^, is a lattice that has a partial 
ordering. If the null space {o} is deleted, each of the subspaces S. is a 'local' greatest 
lower bound, for a total-ordered subalgebra of {s^. We may view as the 'atoms' 

of the measure space {5C, {s^}, |i}, where |it is the dimensional counting measure, defined 
as n(S ) = dim {s } = Tr {p^ }. A set S. £ {s } is called an atom if ^(S.) * 0 and, if S = 

G C *3 1C 1C 

G 

Sj, then either ^l(S^) = (ji(Sj) or |i(Sg) = 0.) 
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It can be shown that S., i=l, . . . , N are pairwise orthogonal subspaces. That is 

Si Sj ^ S. - JC or Pg^ S Ijg. Since by definition each of tti 

S is invariant for all « e A. the set {s.}N ^ i, simultaneously invariant for all of 
the T^. Furthermore, it is unique. Hence we have Theorem 18 . 

There is an abnormal situation when all of the T^ has a simultaneous degenerate 
eigenspace S. such that every subspace of S. is also a simultaneous invariant subspace. 

e construction that is provided here wiU only single out the unique S., but it does not 
urther decompose S. into finer subspaces. The finer decomposition (which is never 
unique) IS unnecessary because this case is unimportant in communication problems. 

It corresponds to a measurement that first resolves the subspace S. and is followed by 
a randomized strategy that we know cannot improve performance. ^ 


APPENDIX N 


Proof of Theorem 2 1 

For the statement of Theorem 21 see Section XII. The proof is in four parts. 
Proof ; The mean- square error 1^ is 


Ij = /g/ Tr{p^G^,}|a-a'|%(c) d^a'd^a. 


(N.l) 


We shall attempt to show that there is a self-adjoint measurement characterized by the 
projector-valued measure such that when the measurement is used the output 

will be one of the M finite number of discrete points {a^}i and have a mean- square error 

^2 " ^ Tr {p n p(c) d^o, (N. 2) 

i=l i 


with |lj^“l 2 l £■ 


The general philosophy of the proof hinges on the fact that the integral Ij in Eq. N.l 
can be approximated by discrete sums over the index set of a and a', with arbitrary 
accuracy, in the sense of a Riemann sum. With this transition the problem becomes 
a 'pseudo-detection' problem, and Theorem 20 applies. 


I ] Z 

Part (i). The function |a-a'j is continuous on compact set S; hence, it is also 
uniformly continuous on S. By assumption G^t is uniformly continuous. Therefore the 
integrand in Eq. N. 1 is also uniformly continuous. 

Let 


/q / P(«) d^ad^o' = / d^tt' = K < 00, 


(N.3) 


since S is compact. For an ^ > 0. there exists a 6^ > 0 such that for all o', a" e S and 


|o'-a"| < 6^, 


! Tr {p^G^ „} ! a- 0 " I 2 - Tr {p_^G^ ,} I a-o ' I 2} I < ^ • 

Define the neighborhoods for all a e S: 

(a) s {o' ! |c-a' I < 6^}. 


(N.4) 


(N.5) 


Than the set of open sets {v^ ^ open cover of S and, since S is compact. 


there exists a finite subcover {v, (o.)}._, such that 

0 j 1 1” <l 

M 

U V, (a ) = s. 

i^ ^1 ^ 

The sets {Vg (a.)} are not disjoint, but we can form disjoint subsets {v. (a.)} from them 
1 ° I ^ 

by arbitrarily assigning the overlapping parts to one of the sets, so that 


(N.6) 
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for i j 


V (a ) n (a ) = 0, 

1 "l ^ 


and 


M, 


U V (a ) = S. 
i 5 l ^ 


(N.7) 


Let 




(N.8) 


Define 


M, 


^ Tr {p Q , }[o-ol,i^ p(a) d^a. 

3 b c o., 1 

r 1 

4VS Tr {p^Q^j }|a-tt:,[^lp(a) 


(N. 9 ) 


^l '^3 = 


d a 


M, 


Tr 


^ Tr {p Q , }|a-a' 

u u 1^=1 ^ ^ 


p(o) d^a 


^ ’ P(“) =-|-- 


(N. 10) 


^ 4 K 1^'“' ^ 

The last inequality is implied by Eq. N. 4 . 

Part (ii). Similarly, since p and |a-a'|^ are both uniformly continuous on S, given 


.M, 


atiy4 ^ O' there exists a 62 > 0 such that if we form the sets {v^ have 


I3-I4I <t- 


(N. 11 ) 


where is defined as 


M2 


‘4 ' jf, j,= T-- P --{^62 <“!>}■ 


where 


Pr{v (a^}= ;a jp(a)d2a. 


'1 ' 


(N.12) 


Note that we can use the same neighborhood as in part (i) by forming neighborhoods of 
size 6 = min (6^, 62) ®*^tl use the same set of Then becomes 

MM 9 A 

‘4 = ,?i v-i'l 


(N. 13 ) 


Part (iii) . Observe that looks like the probability of error expression for the 
M-ary detection problem with a slightly different cost function. By the method used in 
Theorems 19 and 20. it can be shown that there exists a projector-valued measure 
{n^ such that 



(N. 14) 


where 


M M 


I = S S Tr{p n }[ o.-a., r Pr{v, (a.)}. 

5 i=i i.= i «i ^2 1 


Part (iv). If we use the self-adjoint operator characterized by the projector- valued 
measure {ll^ as measurement, the mean-square error is 


IVl _ I I o p 

h ~ P(«) d a. 


(N.15) 


But le is a Riemann sum of the integral I,, and with small enough partition size 6 for 
the V^(a^)» we have 


Iv's' rt- 


(N. 16) 


From part (iii). 




From part (ii). 


I4-I3I 


From part (i). 



(N.17) 


(N. 18) 


(N. 19) 
(N.20) 
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appendix o 

Proof of Theorem 22 


Theorem 22 

Two generalized measurements, characterized by the operator-valued measures 
{s.}.j_y,{T.}.^y , are simultaneously measurable if and only if there is a third gener 
aiized measurement, so characterized by the measure 


(i) S, = 2 Q- 

" iSK, 


and disjoint subsets 


and 


U K, = K. 
iey 


(ii) 


T. = 
3 


2 Qk> 
jSKI 


> 0^3 £ / 


and for disjoint subsets {Kj}jg ^ of K so that 

U K1 = K. / 

3V ^ 


Proof: 

(i) Necessity. H are simultaMously measurable, chare emsts on 

an extended space 3C'*' = K, two commuting projector -valued measures {njjgy. 


{p.}.._ such that 
3^3^/ 


^3C^i^3C’ 




'^j ■ ^5C^j%’ 

Since {n.},{P.} are simultaneously measurable, there exists a third projector-valued 
measure 

(a) n. = 2 (R^, 

^ i£K. 

and disjoint subsets of K, so that 


Vje J 


(b) P. = 2 (R. , 

^ j€K! ^ 


and disjoint subsets {Kj}jg^ofK, so that 

U K' = K. 

je/ ^ 


Therefore 


®i ^K^i^K 


iGK. 


= 2 Ql,. 

i£K. 


Similarly, 

P. = 2 Qj^, 

where is defined as Pac^kPjc’ ^ generality we can form all 

possible products of the form 


Then 


”ij 

1 3 

n. = 

2 (R.. 

1 

je/ 

p, = 

2 (R.., 

ley 

3 


which gives 


S = 2 Q.. 

j^/ ^ 

T = 2 Q.., 

J ieJ 


where 


Q.. — Ptp^4-;^TO* 

3C 13 Jt 

Hence the condition given in the theorem is necessary. 
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(ii) Sufficiency. Let ® projector-valued 

valued measure Then the two projector-valued 


n = s (R 

^ kSKj * 


extension for the operator- 
measures defined as 


P. = 
3 


kSK! 

3 


«k 


commute and are simultaneously measurable, 
is sufficient. / 


Hence the condition given in the theorem 
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APPENDIX P 


Construction for Operator -Valued Measure {Q^jI 


Problem 


Given two simultaneous measurable operator- valued measures ^ 

r 1 i iS*^ J 

want to find a third measure such that 


, we 


Sj= S Q 
X jS/ ^3 

"^3 " i|> 


Wi e c/ 

M3 e /•/ 


Construction 

To find Qj ,* in some sense we would like to find the 'biggest' possible operator Q, , 

A ^ ^ A * 1 

such that Sj = Sj “Qjj> and Tj ~ Tj ~Qjj are still nonnegative-definite. (An operator A 
is bigger than the operator B, A ^ B if and only if A - B ^ 0. The order relation 5= pro- 
vides a partial ordering and Q, , is the maximal element.) Since S, = S Q, . is a mea- 
- * * ^ i>l *3 

sure and should be positive! so is T, . ■* 

^ /\ * 

Sj - Tj = - Tj^ is a boimded self-adjoint operator; therefore! by the spectral 

theorem for bounded self-adjoint operators! there exists a spectral measure {Ej^} such 
that Sj - Tj = §j- - Tj = \dEj^. Hence ®nd Tj = \dEj^, so that 

Qj,=S,-S, = S,-/J 

= T, - T, = T, * ;“i XdE^. 

Now that we have a basic construction for it is possible to generalize by induction 
to find any arbitrary Q.j. Suppose we are given for all i < i'. j < j*! and we desire 
to find the Q.|j, operator. 

Define 

s:. HS,,- £ Q,„ 


j<j' 




T!, = T„- £ 


i<i 


• xj 


Then Q^.^, is the biggest operator such that Sj, ~ Q,,j, ^ 0 and Tj, - Q^.j, 0! and it can 

be obtained by the previous procedure for Qjj- By induction, all of the can be 

found. 
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APPENDIX Q 
Stone's Theorem 


The statement of this theorem is taken from F. Riesz and B. Sz.-Nagy. 


STONE'S THEOREM. Every one-parameter group {u^| (-«<t< «) of unitary trans 
formations for which (U^f. g) is a continuous function of t, for all elements f and g, 
admits the spectral representation 


Where {Ej^} is a spectral family such that w {u^}. 

The proof is due to Sz.-Nagy^® but it was preceded by other proofs. 
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